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THE CONDITIONS OF BELIEF IN ADVERTISING 


A. T. POFFENBERGER 
Columbia University 


A reader of the literature dealing with the Psychology of Ad- 
vertising finds numerous articles written upon the effectiveness 
of various mechanical devices, the attention and memory value 
of size of space and the position on the page, the influence of 
color, style of type and its arrangement, the effectiveness of 
repeating the advertisement, etc. The problem of frousing the 
confidence of the consumer in the article advertised, the condi- 
tions on which it depends, how belief in advertisements may be 
created and how it may be measured have been very lightly 
toucHed in experimental studies. The importance of these mat- 
ters is efiphasized in every textbook on advertising. The fact 
that the American people are each year induced to squander 
many millions of dollars in worthless securities through the 
medium of advertising in some form, and that warnings seem 
quite ineffective in protecting them, makes one curious about the 
basis of belief in advertising. It is not enough to say that the 
American people like to be fooled and that there is no scheme 
too wild to arouse the confidence of a large proportion of them. 
The advertiser should know that action is dependent upon belief 
and that belief in advertising depends upon conditions, some of 
which at least are under his control. 

Belief is indeed a complex mental state and it may depend at 
any time upon a great variety of factors, the most common of 
which are listed in textbooks of psychology and advertising. It 
is not the purpose of this report to repeat such lists but to cite 
some experimental evidence from the field of advertising to show 
the importance of a few of the conditions of belief for success in 
advertising. 


1. Belief is rarely the result of reasoning. One does not go \|Y 
through the processes of logic to establish his beliefs. If logic is 
used at all it is to justify a belief already established. A strik- 
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ing illustration of the separation of reasoning and belief is found 
in the case of the insane patient who firmly believed himself to 
be the son of a king, and yet whose reason was intact enough to 
enable him to solve complicated mathematical problems. 
7 The advertising of the New Gillette razor offered a good oppor- 
tunity for studying the relation between belief and reasoning.’ On 
May 16, 1921, the Gillette Razor Company announced “a new 
triumph of American inventive genius of startling interest to 
every man with a beard to shave.” The advertisements state 
that the “fulerum shoulder, overhanging cap and channeled 
guard” were the three innovations which made possible “for the 
first time in any razor, micrometric control of blade position.” A 
diagram showed “how the blade is biflexed between overhanging 
cap and fulcrum shoulder. It is flexed once into the inside curve 
of the cap. This is the minor flexure—the curve for easy gliding 
action and play of the wrist in shaving. It is flexed a second 
time—more sharply and in a shorter radius—by the grip of the 
overhanging cap the whole length of the fulcrum shoulder. This 
is the major flexure.” This arrangement provided an exactness 
of adjustment to 1-1000 of an inch. Advertisements containing 
the above information and well illustrated were given to fifty- 
seven men, college students and university graduates, together 
with a series of seven questions intended to test both their belief 
in the new razor and their understanding of it. The answers to 
these questions showed that all the students agreed that the new 
razor was better than the old one, and that they would rather pay 
$5.00 for the new one than $1.00 or $2.00 for the old one. In 
supyorting their belief they were allowed to consult the advertise- 
ment as much as they wished. They quoted the “fulcrum shoulder, 
overhanging cap and channeled guard,” which made possible 
“micrometrie control of blade position,” but not one of them could 
explain how the micrometric control was obtained or what ad- 
vantage there would be in having such micrometric control. They 
believed that the “channel guard” was an improvement although 
they could not tell why it was an improvement. As to the import- 
ance of major and minor flexures they were entirely ignorant. 
Five minutes examiyation of an enlarged diagram of the new razor 
improved their understanding of the»razor little or not at all. 
ere is -a belief effectivé for the purpose of the advertiser in 
&pite of the inability of the reader to support his belief with 
reason. This experiment is quoted not to show that the adver- 
tisement is poor, but merely to show that in advertising reason- 


'The study about to be described is part of a larger investigation con- 
ducted by David Goldstein and the writer, which.will be reported at a 
later date. 
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ing is 2 j Whether the space in the ad- 
vertisement devoted to argument might have been more profitably 
filled, is however an interesting question. ; 

2. It is not necessarily that which is true that is ee 
The fact that a statement in an advertisement is true will nof~~ 
guarantee belief on the part of the readers of it. The truth may , 
be too startling and surprising to be believed, and in some cases it Xx 
might be more effective to tell half the truth than the whole truth. 
Three advertisements were chosen for investigation in this con- 
nection, each of them presenting rather unusual information. In 
each case the truth of the statements made was vouched for by 
reziable individuals. Each of the advertisements was presented 
to 100 people, together with a series of statements about them. 
The individuals were to simply check those statements that most 
nearly represented their opinion about the advertisements. The 
statements were carefully prepared so as to avoid suggestion. The 
studies of the three advertisements will be reported separately: 

Taylor Trunk Advertisement. This advertisement showed the 
photograph of a huge elephant standing on a trunk. The picture 
was accompanied by a signed statement as to the genuineness of 
the photograph, and by statements to the effect that the trunk 
was taken from the regular stock. Statistical treatment of the 
replies of the persons tested showed that 38 percent of them 
doubted the truth of the statements made in the advertisement; 
24 per cent questioned the genuineness of the photograph; and 
21 percent believed that it would be impossible to construct a 
trunk strong enough to withstand such a weight. That is, in from , 
one-fifth to two-fifths of the persons tested the advertisement cre- 
ated a state of mind adverse to the purchase of the article. 


Stanley Vacuum Bottle Advertisement. This advertisement rep- 
resented a vacuum bottle falling from a high window, and de- 
scribed an incident in which a Stanley bottle had accidentally 
fallen from an eighth story window and suffered only a dent, 
which did not at all interfere with its usefulness. The replies to 
the questions submitted with this advertisement showed that 31 
percent doubted the truth of the statements, while 22 percent 
refused to believe that a vacuum bottle sufficiently strong to sur-” 
vive the fall could possibly be constructed. 

Edison Phonograph Advertisement. This advertisement pic- 
tured one of the well known “tone tests” demonstrations in Car- 
negie Hall, Pittsburg. It showed the stage occupied by a well 
known singer and an Edison phonograph, with the audience in the 
background. On the advertisement there were reproduced a 
number of clippings from the daily papers describing the remark- 
able demonstration, each clipping making the statement in one 
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form or another that no one in the audience of 2,600 people could 
distinguish between the voice of the singer and its reproduction. 
Replies from 100 people to eight questions about this advertise- 
ment may be summed up as follows: 77 percent doubted the 
truth of the statements contained in the advertisements. 73 per- 
cent believed that they could tell the difference between a real 
voice and its reproduction by any phonograph. 82 percent be- 
lieved that mechanical sounds would betray the phonograph in 
these “tone tests.” 49 percent believed that some form of trickery 
or deception was practiced. 50 percent believed that the artist 
intentionally imitated the phonograph. 44 percent believed that 
a specially constructed phonograph was used for the demonstra- 
tion rather than a stock instrument. 68 percent believed that no 
phonograph could successfully undergo such a test. 39 percent 
stated that an actual demonstration of the “tone test” such as 
that described in the advertisement would not convince them of 
their inability to distinguish between the human voice and its 
reproduction. : 


This advertisement seemed especially effective in failing to create 
belief in the minds of its readers, when over three-fourths of 
them doubted the truth of the statements. The writer recom- 
mended in this case a less extreme and less startling form of 


appeal which should not arouse the antagonism manifested in this 
experiment. 


Thus far the negative side of the question has been presented. 
If belief in an advertisement does not depend upon the truth of 
the statements made and does not depend upon the reasoning of 
the reader, on what does it depend? To state the matter simply, 
we may say that ideas which are present in the mind and are no 
interfered with by any opposing ideas will be believed. This is 
merely a bare statement of the law of suggestion and to compl 
with it in advertising, conflicting ideas should be prevented from 
entering the mind. There are many conditions on which such un- 
disturbed acceptance of ideas depends. Only three will be men- 
tioned here. 

1. The ideas aroused by an advertisement must not conflict too” 
sharply with the reader’s experience. Introspections volunteered 
by many of the 100 subjects who served in the experiments just 
described indicated that théir past experiences with trunks, va- 
cuum bottles and phonographs furnished conflicting ideas which 
the advertisements were not sufficiently powerful to overcome. 
This was especially true in the case of the phonograph advertise- 
ment where doubt was expressed in a large percentage of the 
cases. 


An experimental study of an advertising campaign that failed 
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true, underline the word “True.” 
underline the word “False.” 


showed clearly the need for complying with the conditions of 
belief. “Radior Products,” a series of toilet preparations were 
introduced into this country by an English firm. 
contained in the advertising may be illustrated by the following 
quotation taken from one of the advertisements: 
magic new word in the book of beauty. 
union of the finest complexion preparations with actual radium. 
Its content of radium works the miracle of nature. 
the skin, gives it the health to regain its youthfulness and loveli- 
The container of these preparations was represented as 
emanating rays which were very suggestive of bolts of lightning. 
these toilet preparations could not be sold—the campaign had 
failed. Why? One possible reason for the failure was that the 
suggestion to apply radium on the skin for toilet purposes con- 
flicted too sharply with people’s opinions about radium. 
to discover what people believed about radium, a questionaire was 
carefully prepared in the form of a True-False test and submitted 
to 400 people, comprising a well-to-do, well-educated group, a 
group of average intelligence and financial standing and a special 
group of persons working in “Beauty Parlors.” 
is reproduced below. 
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The appeal 


“Radior is the 
It means the triumphant 


It purifies 


In order 


The questionaire 
It was constructed so as to minimize the 
influence of suggestion and to correct for it when it did occur, as 
for instance in combining replies from question 2 and 7. 
tions 3, 5 and 10 were introduced as fillers and had no special 
significance. 


Ques- 


Read the first statement on this sheet. If you feel that it is 


8—Radium causes burns when it comes into 
contact with the skin 


If you feel that it is false, 
Then judge each of the other state- 


ments as true or false. If you do not know, guess at it. This 
sample statement is false and is marked as it should be: 

There are eight days in the week.....................0008 True False 
1—Radium is a deadly True False 
2—Radium is so costly that it cannot be 

used for commercial False 
38—Treatments with radium will make 

4—Radium is used in the treatment of —— 

such diseases as False 
5—Radium is a substance which gives off light......Tyyge. False 
6—Radium when applied in diluted form gives : 

a healthful stimulation to the skin Tare False 
7—Radium is now used in the manufacture 

of many products in common use True False 
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9—Toilet preparations containing radium are to 

be recommended for their beneficial effects False 
10—Certain forms of radium can be obtained =F, 

at little cost alse 
11—Radium preparations should not be used 

except on the advice of a physician True False 
12—Radium products are gaining popularity 

mild stimulants to health aad growth rue > 

Read the following question and answer it by undeflining “Yes” 

or “No”: 


If you should be told that there was a toilet preparation of high 
quality and moderate price and which contained a minute quantity 
of radium, would your knowledge of the effects of radium be 
such as to make you want to buy it?...........ciccccsssesseceeees Yes No 


Nothing need be said here concerning the difference in the 
reactions of the different groups to the questionaire, except that 
the most intelligent group had the strongest reaction against 
radium-containing products, and that the “Beauty Parlor” workers 
had the least antagonism. Inquiry among these people indicated 
that some of them had confused radium with the violet ray which 
was at the time a very popular form of treatment with them. The 
results of the study of the questionaire may be summarized as 
follows: 

Forty-eight per cent of the persons tested believed that radium 
is a deadly poison; 80 per cent believed that it causes burns when 
it comes in contact with the skin; 89 per cent believed that radium 
preparations should be used only upon the advice of a physician; 
71 per cent said that they would not buy any kind of toilet prepa- 
ration that they knew contained even a minute quantity of radium; 
90 per cent thought of radium as a substance used for the treat- 
ment of cancer. The whole experiment may be summed up in th 
statement that about 70 per cent of all the replies indicate 
opinions unfavorable to radium-containing products as toilet 
preparations. The basis of this unfavorable reaction could easily 
be traced to the people’s accumulated experiences of radium. 
Clippings of all articles dealing with radium and appearing in 
newspapers and magazines were collected for a short period. 
Practically all of them emphasized the harmful effects of radium 
and the dangers incurred in handling it, instead of any beneficial 
properties it might possess. 

This is a clear case of the inability of an advertising campaign 
to overcome the resistance established by experience. To create a 
favorable attitude toward radium-containing products. as. toilet 
preparations by an educational campaign eonducted in news 
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papers and magazines might conceivably have been possible. But 
it certainly would have been impracticable. 

The mass of experiences upon which beliefs rest is gradually 
accumulating and being modified by additions» As some one has 
said, “Confidence is at the same time stable and fragile.” It is 
stable because of the bulk of experiences on which it rests; it is 
fragile because it is subject to change from new experiences. A 
supposed bargain which is found to be not a bargain, a derogatory 


newspaper report about a store, a misprint in the advertised price 
of an article; any one of these may be the experience that will 
introduce a conflicting idea into the mind of a consumer and 


the conditions favorable for his belief. The advertiser, app 


user, a] 4 
he does to populations, cannot take account of the shiftings of 
individual experience and belief, but he must take account of the 


widespread and slowly accumulating experiences which may 


amount to powerful prejudices against his appeal. 


uthorita- 


2. Ideas that are to create belief must co 


ti This is a well-known law of suggestion. The 
hypnotist can do nothing without his air of authority and the 
subject’s recognition of it. We are accustomed to believe the 
statements made by a person in whom we have confidence, and to 
believe what is printed in a medium which we consider authorita- 
tive. Even if there is conflict with one’s own experience, he will 
someti 


person as a 


times accept the contrary experience of anoth« 
basis for belief if he have sufficient confidence in the « 


ther person. 
But even then the new experience may not be too conflicting. 
Advertisers have for years striven to develop an atmosphere of 
confidence and authority by all the devices at their command. 
The present experiment was intended to measure in a tentative 
fashion the degree of confidence which an advertisement can create 
in comparison with other forms of publication!’ I have compared 
the degree of belief or doubt aroused by the three advertisements 
previously described (namely Taylor Trunks, Stanley Vacuum 
Bottles and Edison Phonographs) with the belief aroused by 
essentially the same statements coming from a reputable journal. 
In order to make such a comparison, the facts stated in each 
advertisement were prepared in the form of a news item abstracted 
from an engineering magazine. One such abstract will illustrate 
the character of all. These abstracts were presented to a group 
of 100 persons of the same general character as those tested in 
the earlier experiments, but who knew nothing of those experi- 
ments or nothing of the purpose of the present experiment. Along 
with the abstract of each advertisement was a series of questions 


as nearly as possible like those used in the test with the advertise- 


ents. As far as we were able to ascertain, no one doubted the 
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authenticity of the news abstracts. Instead of reporting the re- 
sults of the three questionaires in detail, it will be sufficient to 
compare for each of the three cases the number of replies indicat- 
ing doubt in the advertisement and doubt in the news item. It 
must be remembered that the news item mentioned no trade-named 
article nor indicated any connection with specially advertised 
goods. 

The following is an extract from————————_Magazine of June, 
1921. Read it carefully and then answer the questions given be- 


low, by scratching out with a pencil the part of each that does 
not give your opinion. 


A REMARKABLE SHOCK RESISTING CONSTRUCTION 


“A new type of wardrobe trunk was recently put through a 
remarkable test of strength. The five-ton elephant belonging to 
the New York Hippodrome was made to stand on the empty trunk. 
An examination of the trunk following the test showed that it 
had withstood the great shock without damage. 

Interesting photographs were made of the test, showing the 
elephant mounting the trunk, seated upon it, and then stepping 
off of it.” 

1—I believe—doubt—do not believe—the statements made in the 
above extract. 

2—I feel that the photographs that portray the test are—are 
not—genuine. 

3—I do—do not—believe that a trunk could be made strong 
enough to successfully undergo the test described. 

Percentage of replies expressing 


doubt In the Ads In the News Items 
28 per cent 43 per cent 
Stanley Vacuum Bottles ................. 27 per cent 43 per cent 
Edison Phonographs. 55 per cent 40 per cent 


In two of the cases it will be noted that there was greater con- 
fidence in the advertisement than in the news clipping, while in 
the third there was greater confidence in the news clipping. An 
examination of the three.advertisements did not afford an entirely 
satisfactory explanation for the shift of belief in the case of the 
third advertisements (It seems safe to conclude from these records 
that although belief in certain advertisements may be low, they 
may carry at least as much authority in presenting a set of facts 
as can be conveyed by a news article. The doubt in the cases we 
have studied is the effect rather of conflict of ideas with experi- 
ence, than the effect of the use of an unauthoritative medium of 
expression. Introspections volunteered by the subjects suggest that 
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illustrations and especially photographs used in the advertise- 
ments tend to strengthen belief. Unless one suspects trickery, as 
some of our subjects did, looking at a picture ought to carry with 
it greater weight in establishing belief than merely reading 
printed matter. Even if Mark Twain was right in advising that 
one believe only half that he sees and nothing that he hears, the 
advantage in favor of the picture is obvious. 

3. There is a third important condition of belief, namely, that 


we tend_to lev hat arouses our desires, our fears and our 


en ally. I have no experimental evidence to offer in 
this connection, and know of none in the field of advertising. But 
evidence fer the importance of this factor may be drawn from 
psychology. Wm. James has said, “A man who has no belief in 
ghosts by daylight will temporarily believe in them when alone at 
midnight, he feels his blood curdle at a mysterious sound or vision, 
his heart thumping and his legs impelled to flee.” In strong emo- 
tion we might find the condition responsible for the belief in the 
bargain counter. The politician finds no difficulty in honestly be- 
lieving what best fits in with his aspirations, while his opponent 
may as honestly believe the opposite and for a like reason. If 
one really wants a certain suit of clothes or an automobile which 
costs more than he should pay, he may honestly believe that he 
is making an economical purchase. Once a belief has been estab- 
lished in this way, logic and reasoning may be used to support it. 

These three conditions of belief which I have described repre- 


sent three important variables in the formula for advertising suc- 


cess. For most advertising situations they are unknown variables 
which may interact in a very complex manner. But they may be 
discovered by careful examination. That they are not always 
sought or discovered, is clear from the cases I have cited which 
were taken from advertising already used. Three questions might 
well be asked about every advertisement: (1) What adverse be- 
liefs and experiences does it have to meet in the minds of con- 
sumers? (2) Will the authority which it wears by its mode of 
presentation or by the medium in which it appears enable it to 
create belief? (3) Does the appeal used arouse desires which will 
in turn create belief in the advertised article? These are human 
behavior questions that psychological methods will enable the ad- 


vertiser to answer before the advertising is used as well as after 


the money has been spent upon broadcasting it. 
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MINOR STUDIES IN THE PSYCHOLOGY OF 
ADVERTISING 


From the Psychological Laboratory of Indiana University 
By H. D. Kitson, C. W. BARNES, AND C. IUPPENLATZ 


IX. RIGHT AND LEFT HAND PAGES IN MAGAZINES 


A considerable amount of discussion has centered 
question as 


around the 
to which is the more favorable position for advertise- 
ments in magazines, the right or left-hand page. The question 
has usually been approached through the laboratory method, sult- 
jects viewing the momentarily exposed pages of a 


“dummy” or a 
real 


magagine and then writing down all they have seen in the 
order in which they have seen it. With this technic Adams’ re- 
rts a value of the right hand page which is to that of the left as 
190 is to 65. That is, he estimates the right*hand to be fifty per 
ipericr. Hotchkiss and Franken’ estimate from their inves- 
tigation that the right-hand page of a magazine is worth five per 
cent more than the left-hand page. 

In order to secure additional light upon the question the writers 
approached the problem with the historical technic, seeking to dis- 
cover the frequency with which advertisers actually used the right- 
hand and left-hand pages. 

The materials used were 1082 consecutive pages of advertising 
matter in the American Magazine and 1000 in the Saturday Eve- 
ning Post, for the year 1922. Beginning with the first advertise- 
ment following solid reading matter tabulation was made of the 
successive pages containing advertising matter, exclusive of cover 
pages. Results are shown in Table I. 


TABLE I. 


SHOWING THE FREQUENCY WITH WHICH RIGHT AND LEFT-HAND 
VAGES CARRY ADVERTISEMENTS IN THE AMERICAN 
MAGAZINE AND THE SATURDAY EVENING 
POST FOR 

Total Total Total Full Full Full 
Right Left ~ Total Right Left Total 
No. % No. '% No. -% No. -% No. 
Miebee 576 56 506 44 1082 . 301 82 64 18 365 
S. E. Pu. 521 52 481 48 1000 4350 67 171 33 521 


111. EF. Adams, Advertising and its Mental Laws, New York, Macmillan, 
1916. p. 100. 

2G. B. Hotchkiss and R. B. Franken, Attention Value of Advertisements, 
New York, New York University, 1920, p. 24. 
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These figures show that in both magazines the right-hand page 
is used for advertisements more frequently than the left; in the 
American 56 per cent of the times and in the Saturday Evening 
Post 52 per cent of the times; with a frequency on the aver- 
age 17 per cent greater than that for the left-hand page. 

Still stronger evidence of the favor in which the right-hand page 

held comes from facts shown in a separate tabulation which 
was made concerning the location of full page advertisements. 

here were 365 of these in the American Magazine and 521 in 


Saturday Evening Post. Eighty-two per cent of the form 


1 67 per cent of the latter appeared on the right-hand page. In 
first named periodical the right-hand page was used for full 
advertisements five times as frequently as the left 1 ji 
Saturday Evening Post twice as frequently. It was <« \ 
certain advertisers who consistently used the same relat 


in issue after issue almost invariably used the 1 


On the asstmption of the historical method, that a practi 
ich in vogue has probably proved itself profitable, it would ap 
pear from thése results that the right-hand page in magazines is 
profitable than the left. Trte, the practice here revealed 
eed not necessarily be considered as having been based upon 
nseious reckoning of comparative profits and losses from right 
and left-hand insertions. Even at best it probably represents only 
the fruits of a “sub-conscious” “hunch.” Nevertheless it is com- 
ing to be recognized that such a process of summing up experience 
has some validity as a guide in business practice where scien- 
tifically valid measures are not yet obtainable. 

Appearance on the right or left-hand page of a magazine should, 
however, not be considered as wholly a result of the voluntary 
choice of the advertiser. Requirements of journalistic make-up 
would inevitably put some advertisements on the right and’ some 
on the left-hand page. Again, the requirements of color process 
or of some other technical matter might force certain advertise- 
ments on one page rather than another. Moreover, some adver- 
tisers do not specify either page, not regarding the matter as im- 
portant enough to bother with. For these reasons the more fre- 
quent use of the right-hand page can not be regarded as purely a 
reflection of profitable experience. Still these factors alone might 
be expected to work toward an equal distribution of advertisements 
upon the right and. left-hand pages. The. fact that they are un- 
equally distributed indicates that some selective factor is at work 
in favor of the right-hand page. And the fact that this page is 
used with overwhelming frequency by the purchasers of the ex- 
pensive full page advertisements indicates that the right-hand 


— 


é 


3 
> t 
4 
a 
a I € 
: 
t 
I 
page. ¥ 
4 
a 
id 
| 
4 
j 
| 
| 


12 KITSON—BARNES—IU PPENLATZ 


page has demonstrated its superiority to at least some influential 
and successful advertisers. On slighter evidence than this space- 
sellers have set “preferred-position” prices. 


X. Postrlon ON NEWSPAPER PAGE 


In answer to the question, what is the best portion of a news- 
paper page for an advertisement, recommendations have been 
made as follows: the upper half, the right-hand portion, upper 
left-hand corner, upper right-hand corner. Or, in recognition of 
the need for considering the right and left-hand pages separately, 
the statement has been made that the upper outside corner is 
preferable.’ These conclusions have been reached chiefly from 
laboratory investigations in which the subjects have been shown 
a “dummy” or a real newspaper or magazine containing adver- 
tisements or nonsense syllables distributed among the various por- 
tions of each page, and have been asked later to write down those 
items which they have remembered. The positions of those items 
mentioned the greatest numbers of times have been judged to be 
the most favorable. It should-be noted that the conclusions re- 
garding favorability of positions are for the most part based upon 
studies made with magazines. There is considerable doubt 
whether conclusions reached regarding positions in magazines are 
true also for newspapers. 


Regardless of the advantageousness of these particular positions 
on a newspaper page it may be seen from even a cursory exami- 
nation of the best type of newspaper that they are not equally 
open to the use of advertising matter. For there is a practice of 
newspaper make-up that crowds advertisements into certain por- 
tions of the page in order to reserve certain other portions for 
news. A circumstance which presents grave obstacles to the use 
of the. positions which have been so laboriously designated as ad- 
vantageous. 


Specifically described, the practice of newspaper make-up, tech- 
nically called “pyramiding,” is to block up the advertisements first 
from the lower right-hand corner of the page. On some pages 
(usually the second page of the newspaper) only a small portion 
of this corner is used. But as an increasingly larger portion of 
the page (towards the middle and latter half of the paper) is 
consigned to advertisements the advertisements begin to run up- 
ward and leftward until often only a small portion of the upper 
left-hand corner is left for news. But this corner is usually thus 


1Adams, op. cit. p. 101. 
Hotchkiss and Franken, op. cit. pp. 23-25. 
eet Advertising. Scott Foresman and Co. 1914, Chicago, pp. 
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yeserved (unless the entire page is devoted to advertisements). 
Here is an apparent block to effective newspaper advertising, at 
least to that theoretically recommended by laboratory investig- 


ators. 


In order to measure this block and express it in quantitative 


terms the 
practice. 


writers undertook an investigation of ‘actual journalistic 
Choosing a newspaper that admittedly represents a high 


standard of journalistic make-up and advertising value, they tab- 
ulated the number of times the different quarters of pages were 
occupied by advertisements, and from this calculated the chances 
that an advertiser might secure a “favored” place for his adver- 


tisement. 


The New York Times was used, and 1000 successive pages bear- 
ing advertisements and news were chosen from the issue of May, 
June and July, 1922. Those pages were omitted which contained 
nothing but advertisements, also those containing classified adver- 


tisements. 


the particular portion (half or quarter) contained any advertis- 


Notation was made in each case as to whether or not 


ing matter, be it ever so small. Results are shown in Table I, 
and Figure 1. 


SHOWING 


RIGHT 


TABLE I. 


THE NUMBER AND PER CENT OF UPPER AND LOWER, 
AND LEFT PORTIONS OF LEFT AND RIGHT HAND 


PAGES OCCUPIED BY ADVERTISEMENTS IN THE 


505 


Lower Half 


Left 


NEW YORK TIMES DURING MAY, JUNE 
AND JULY, 1922. 
Left Hand Pages 495 Right Hand Pages 


Upper Half Lower Half Upper Half 
Right Left Right Left Right Left Right 


Quarter Quarter Quarter Quarter Quarter Quarter Quarter Quarter 


No. % 


17535 


Left hand page 


No. % No. % No. % No. % No.% % ‘ON % ‘ON 
300 77 150 30 370 73 115 23 335 #68 115 2 295 60 


Right hand page 


73 23 /, 60 


35 | 


Fig. 1. 


77 23 68 


Showing the per cent of pages on which the various 


quarters were occupied by advertising matter. 
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1. Left and Right-hand Pages. 

Of the one thousand pages examined, 505 were on the left-hand 
page and 495 on the right-hand page. (It is interesting to note in 
this connection that full-page advertisements, which had been tab- 
ulated incidentally but not counted among the thousand shown in 
the table, appeared more frequently on the right-hand page. Of a 
total of 235 full-page advertisements, 120 or 53 per cent were on 
the right-hand page.) 

2. Lower half. 

a. Left-hand page. On the 505 left-hand pages the lower left 
quarter was occupied by advertisements in 175 or 35 per cent of 
the cases; the lower right quarter, in 390 or 77 per cent of the 
cases. 

b. Right-hand page. On the 495 right-hand pages the lower 
left quarter was occupied by advertisements in 115 or 23 per cent 
of the cases; the lower right quarter, in 335 or 68 per cent of th 
cases. 

3. Upper halj. 

a. Left-hand page. On the 505 left-hand pages the upper left 
quarter was occupied by advertisements in 115 or 23 per cent of 
the cases; the upper right quarter, in 295 or 60 per cent of th: 
cases. 

b. Right-hand page. On the 495 right-hand pages the upper 
left quarter was occupied by advertisements in 155 or 23 per cent 
of the cases; the upper right quarter, in 295 or 60 per cent of 
the cases. 

CONCLUSIONS. 

These results show that regardless of the alleged superiority of 
certain positions in newspapers, there are pronounced limitations 
to the likelihood that these positions may be secured. Thus 4gl- 
though the right-hand page may be more desirable, it will be open 
for advertisers no more frequently than the left, since these results 
show it to be occupied only 495 times in 1000. > 

The upper outside quarter, which is recommended as being most 
valuable of all can be secured only 30 times in 100 on the left-hand 
page, though, fortunately it can be secured 60 times in 100 on the 
right-hand page. (For the sake of easier visualization, the fre- 
quency with which the various quarters of left and right-hand 
pages are used is shown diagramatically in Figure 1.) 

The right-hand side of both pages is twice and even three times 
as likely to carry advertisements as the left-hand side. Which 
means that in the case of the left-hand page the majority of ad- 
vertisements are going to be on the inside. And, unfortunately 
for the advertisers, this tendency is more marked in the case of 
the left-hand page than in the case of the right. These amoants 
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may not hold good for all newspapers, especially for papers using 
the pyramid system more thoroughly than the New York Times. 
With such papers it is likely that the percentage of pages carrying 
advertisements in the upper left-hand corner would be still smaller. 
From a journalistic point of view this quantitative investigation 
of the practice of pyramiding shows that the pyramiding pro- 
gresses much more rapidly upward than leftward, the upper right 
portion of the page being occupied almost as frequently as the 
lower right portion. Though it should be remarked that the ad- 
vertisements did not usually occupy the entire upper right 
ter; tabulation was made if an advertisement m 
into the area. 


quar- 
erely extended 


° 
While this investigation furnishes no light concerning the rela- 
tive values of the various positions on the newspaper page, it 
hows that there are distinct limitations to the ease with which 
oh in positions can be secured, and it shows, at least for 
ae grade newspaper, the size of the limitations in quantitative 
terms. It also suggests that the findings of laboratory studies re- 
garding many other problems of advertising might profitably be 
checked up with actual conditions in order to show the practica- 
bility of their application in concrete situations. 


one 


a 
and 
fa 
on 
left 
t of a 
4 
the 
4 
wer 
cent q 
lett 
it of 
the 
| 
q 
| 
- 
4 
i 


AN EXPERIMENT IN TESTING ENGINE LATHE 
APTITUDE’ 


By Everett F. PATTEN 
University of Wisconsin. 


The investigation reported in this paper was carried out in 
co-operation with the Shop Practice Department of the College of 
Engineering, University of Wisconsin. It was able to profit ma- 
terially by a preliminary investigation made along the same lines 
in 1921.° It was also favored by the fact that in the Shop Prac- 
tice Course there were many students turning out supposedly 
identical’ pieces of work on fairly similar lathes. There was an 
added advantage in the fact that the men directing the work in 
the shop were university trained, and consequently were able to 
give such aid that the preliminary psychological analysis of the 
job was made more readily than would probably be the case in 
industry. 

The particular type of work investigated was a course in ele- 
mentary lathe practice. This involved four hours of shop work 
every week and two hours of instruction every two weeks. The 
students, following a combination of oral and printed instructions 
together with blueprints, turned out five pieces on an engine lathe 
during the semester. These were: a worm, a worm-wheel blank, a 
bevel-gear blank, a spiral-gear blank, and a planer bolt. The first 
four were turned out of cast iron and the last from rod steel. 

The criterion of this investigation was the amount of skill de- 
veloped in one~ semester of instruction. More specifically, the 
criterion was the quality of lathe work produced by the students 
during the course. Instructors’ estimates are notoriously unre- 
liable, as was found by the preliminary investigation and should 
never be used if an objective criterion can be obtained. Fortun- 
ately in this case the pieces themselves could be measured and a 
thoroughly objective criterion thus found in the accuracy with 
which these pieces correspond to the blueprint specifications. Work- 
ing with great care Mr. Brecky of the Shop Practice Department 
took thirty-three micrometer measurements on the five pieces pro- 
duced by each man measuring the various important dimensions 


1krom the Psychological Laboratory, University of Wisconsin. Grateful 
acknowledgment is hereby made to Prof. Clark L. Hull, Department of 
Psychology, and to Prof. W. L. Dabney, director of Machine Practice. Prof. 
Hull has supervised the investigation and given much helpful criticism, 
especially in the preparation of the manuscript. Prof. Dabney has given 
hearty co-operation and been of great assistance particularly in procuring 
subjects and in perfecting some of the tests. . 

2This investigation was reported in a master’s thesis by D. W. Nelson. It 
is on file in the library, University of Wisconsin. 
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of these pieces. Several thousand very accurate readings were 
thus obtained. In addition the general finish of each piece was 
scored on a percentage basis. 

The measurements obtained were reduced to,percentage grades. 
The deviation of each measurement from the dimension called for 
in the blueprint was first found. A check distribution was then 
made for each of the thirty-three measurements each distribution 
containing the deviations of all the men on that measurement. The 
median of each distribution was assigned the value of 85, the 
average grade given by the Shop Practice Department. The in- 
tervals of the distribution between the median and zero deviations 
were assigned values between 85 and 100 according to the size 
of the deviation. The fourth check from the opposite end of the 
distribution was assigned the value of 70 which is hardly passing 
on the school seale. The intervals of the distribution between this 

oint and the median were assigned values between 70 and 85. 
The last three checks of the distribution were g:ven below 70 or 
failure. Thus the larger the deviation from the standard the 
smaller would be the percentage grade. The mean of the grades 
for each individual piece was found and the average of the mean 
grades of the five pieces belonging to one man was computed in 
order to get his final grade in lathe working ability. Although 
this process took weeks of work the resulting criterion was very 
reliable. Indeed the large coefficient of total correlation (R) ob- 
tained later was probably due in no small degree to the care taken 
in obtaining the criterion. 

In proceeding to find tests which would measure lathe ability it 
was first found necessary to analyze that ability as far as possible. 
This was done by watching the men at work observing particu- 
larly wherein the good ones differed from the poor ones. As a 
result of this analysis eight tests were selected. Some of these 
at first glance would seem to have little connection with lathe 
ability in general, but they were included because they were thought 
to test a constituent factor. In selecting the tests an especial effort 
was made to get tests as dissimilar as possible so that there would 
be little duplication. The detailed analysis will be given with the 
discussion of the various tests. 

(A) While studying the men at work it was noticed that the 
worker must be able to estimate two dimension space and to co- 
ordinate the motions of his two hands. He must also be quick 
and sure of movement. In order to test these abilities the “co-or- 
dination machine” was devised.’ The apparatus is shown in Fig. 


The need for such a machine was first seen by Dr. Hull, who worked out 


its general principles. The practical details of construction were worked 
out by Prof. Dabney. He also designed and made a number of improved 
models. 
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1. It may be best thought of as a compound lathe “rest” attached 


Coordination Machine. Fig. I. 


to a beard. The action of the machine depended on two screws 
placed at right angles to each other. Turning these screws bv 
means of small cranks caused a carriage to move in any desired 
direction in a horizontal plane. A long tracer arm was fastened 
firmly to the carriage. Projecting down from the middle of the 
arm was a small brass point. This touched the surface of a hard 
rubber plate which was fastened to the board near the main part 
of the machine. Embedded in the plate so that their tops were 
flush with the surface were six brass pegs arranged in a three- 
inch hexagon. These pegs were connected by an electric wire so 
that when the tracer point touched one of them a buzzer sounded. 
The object of the test was to start on signal from a certain point 
and make the circuit of the s:x brass spots as quickly as possible. 
The time taken to go around was measured by a stop-watch. A 
correlation of this with the criterion gave a-.423. This was the 
largest and most valuable correlation obtained. 

(B) Another measure was taken of this test, that of distance 
traveled. The tracer arm extended out beyond the contact point 
and had a pencil fastened to its extremity. This traced an exact 
record of the circular path on a piece of paper inserted for the 
purpose. A typical record is shown in Fig. 2. The distance 
traversed by this rather meandering line was measured by a 
specially devised instrument. The frame supporting the works of 
an ordinary watch was filed so as to expose the gear wheel on the 
shaft of the small second hand. The hands of the watch remained 
intact. The frame was so held in the hand of the measurer that 
the teeth of this gear could be run around the pencil tracing, fol- 
low:ng in detail ail its turns. The distance traversed by this wheel 
was shown by the hands of the watch and could be read off at a 
glance. It will be seen that the circular distance was measured in 
units of revolutions of the small hand. One minute (or one revolu- 
tion) was equal in length to 29.5 millimeters. The correlation of 
this measure with the criterion was +.192. This was rather sur- 
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a prising since it meant that the greater the distance, the better 
a the grade on shop work. However, it was included in the final 


team of tests. 


Fig. 2. Typical Tracing from Coordination Machine. 


(C) A third score was also secured by the use of the co-ordi- 
nation apparatus described above. It was observed that the 
worker must frequently cut a true surface in a diagonal direction 
using his eye to guide him. Accordingly the subject was required 
to run the tracer diagonally across between two of the most dis- 
tant pegs going as quickly as possible. Here, as before, the time 
was taken with a stop-watch. The correlation of this measure 
with the criterion gave -.356. This was large but since it was 
thought that it would duplicate the first measure taken, that of 
time on the circular distance, measure C was discarded. 

(D) The second measure of the diagonal co-ordination was 
made in deviations from a straight line between the two pegs. If 
a subject traced a straight line his record would coincide with 
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the diagonal and there would be no deviation. However, if he 
wavered and made wrong moves, his record would deviate con- 
siderably from the straight line distance and would be poor in 
proportion to the amount of deviation. This deviation was meas- 
ured by the area of surface included between the record tracing 
and the true diagonal as shown by a straight line drawn between 
the points. This was measured by means of a planimeter. The 
correlation of this measure and the criterion was —.034. This was 
so small as to be worthless. If the instructions had been to trace 
as straight a line as possible this might easily have proven a val- 
uable test. It seems to have distinct possibilities. 

It will be noted that the co-ordination test described above is a 
form of miniature test. It attempted as is usual in such cases to 
incorporate in a single test as many of the various elements neces- 
sary for success on the job and in as nearly the right proportions, 
as possible. Being a miniature it was portable and easily handled. 
One defect of the test is that it is undoubtedly influenced by train- 
ing and thus is not a pure aptitude test. 

(E&F) Further analysis of the job showed the necessity for 
close co-ordination between hand and eye as well as for quickness 
and accuracy of movement. Accordingly the number checking test 
was introduced. This was the standard Woodworth and Wells 
number checking test in which the subject was started on signal 
and told to cross out all groups of digits containing both 4 and 7. 
He was given one minute. The correlation of the number of right 
groups checked with the criterion was +.055. This was too low 
to be of use so was discarded from the final team of tests. The 
number of groups wrongly checked correlated —.035 so it was also 
discarded. 

(G) It was believed that the possession of mechanical ingen- 
uity would be important for success in lathe working so an effort 
was made to test for this quality. Accordingly an adaptation of 
Stenquist’s lock assembly test was used. The lock was simple and 
inexpensive. Tf onsisted Of Six parts which were: a rectangular 
lower case or plate, a dog, a spring, a bolt, a screw and an upper 
plate. The subject was shown a similar lock which was already 
assembled and told to put the parts of the other lock together 
like it. Although the lock seemed very simple, many had great 


difficulty in assembling it. Four failed entirely giving up after 


working from eleven to twenty-one minutes. The correlation 
with the criterion was —.199. Although not very high (-.33 
was obtained last year) it had such a low correlation with the 
other tests that it was kept in the final team as tapping a real 
constituent of the particular aptitude under consideration. 
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(H) The ability to follow printed directions and blueprints 
is obviously a factor in learning lathe work. To measure this 
ability and to gain knowledge of the subject’s carefulness and 
painstaking traits, a box-directions test was introduced. This 
test is the well-known Healy puzzle box, consisting of a small 
black cabinet containing a lock mechanism worked by three levers 
at the top and left sides of the box. The object of the test is 
to open this door. The necessary steps in the process were 
printed on a large card bearing an excellent diagram of the box. 
This was before the subject as he worked. This test proved 
even more difficult than the lock test. Ten men did not open the 
box at all, some giving up with the first failure. The correlation 
of the time wita the criterion was -.270. This was large enough 
to warrant its use in the tinal team of tests. 

(1) The number of wrong moves made was another measure 
taken on this test. However, this measure undoubtedly did not 
introduce any new element because errors caused the subject to 
start over again and thus increased his time. In addition the 
correlation was low (—.112) so this measure was eliminated. 

(J) In doing good lathe work a great deal depends on the 
skill of the operator in taking accurate measurements. Accord- 
ingly the cylinder gauging test was introduced. A series of nine 
brass cylinders was very Carefully made under Prof. Dabney’s 
direction. These measured .970’, .965’’, .960’’, .940’", .930”, .9007’, 
.885’7, and .870’. It will be seen that there were three groups 
each composed of three cylinders. Each group or series was grad- 
uated by a different constant the same‘ constant being used 
throughout a single group. The finest series changed by steps 
of .005’’, the next by .010’, and the coarsest by .015’... The sub- 
ject was given a six-inch machinist’s rule, and a pair of calipers 
and told to find the cylinders which measure .965” and .885” as 
quickly as possible. The time taken to find the two cylinders 
constituted one measure. This correlated —.127 with the criterion 
and was too low to be of value. 

(K) Another score on the cylinder test was the amount the 
chosen cylinder deviated in measurement from the one asked for. 
Thus if the subject chose the .930’ cylinder in place of the .965’ 
he was given a deviate score of .035. If he thought the .900” 
cylinder measured .885’’, he scored .015. The sum of these errors 
gave his deviation score, as .050. This measure correlated -.196 
with the criterion. This seemed sufficiently high to be of use but 
it was finally decided to eliminate the test from the final team 
because it was really a trades test measuring acquired skill rather 
than aptitude. 
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(L) An effort was made to test ingenuity and reasoning ability 
by the match-squares test, though at the outset there seemed but 
a gambling chance of success. The match-squares test consisted 
of seventeen matches glued to a board in such a way as to make 
six squares in two banks of three each. The subject was shown 
these squares then given a mimeographed sheet on which short 
lines were drawn to represent the matches in the same position 
as on the board. He was told to cross out five of the short lines 
in such a way that if they were removed there would be left 
exactly three whole squares and no lines left over. The time 
was taken with a stop-watch. When through, the subject was 
asked whether he had ever done this or a similar puzzle before. 
The fact that many were already familiar with it together with 
the additional fact that twenty-two out of the sixty subjects could 
not do it at all seems to have destroyed what little diagnostic value 
the test may have had. The correlation with the criterion was 
only +.164. This was too small and also in the direction con- 
trary to what would have been expected so the test was discarded. 

(M) In watching the men at work it seemed probable that 
a keen sense of space relationship was necessary, particularly 
in guiding the cutting tool to its place and in finding centers. Ac- 
cordingly the circle centering test was devised. A sheet of paper 
bearing a number of mimeographed circles, (see Fig. 3) was put 
before the subject. He was given a pencil and asked to make a 
dot in the center of each circle as quickly and accurately as pos- 
sible. The task was complicated somewhat by the overlapping 
of some of the circles. The time required to place these dots was 
taken by a stop-watch and constituted one measure. This corre- 
lated ~.201 with the criterion and since it was quickly and easily 
taken. Since the test was so cheap and portable it was kept 
in the final team of tests. 

(N) Another measure was taken from this test the errors in 
locating the centers. These deviations from the true centers 
were measured translated into grades by a rather complicated 
process and these grades averaged to get the final grade for each 
man. Since this measure was not used the process will not be 
discussed further. The correlation with the criterion was fairly 
high (-.273) but was in a direction contrary to what was ex- 
pected. Apparently the good subjects sacrificed accuracy to speed. 
Partly because of this, and partly because of the labor in scoring, 
this test was not included in the final team. 

(O) It seemed reasonable to believe that if a man were skillful 
in handling woodworking tools he would also tend to be an adept 
in using other kinds, such as are employed in lathe work. Fol- 
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Fig. 3. Circle Centering Test. 
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lowing this line of reasoning the woodworking test‘ was given. 
The subject was asked to saw a three-Inch square from a small 


board of definite size and shape, being given a saw, a square, a, 


pencil and a special device for holding the wood such as is in 
common use in manual training shops. He was timed by means 
of a stop-watch. This measure had the surprisingly low correla- 
tion of —.119 with the criterion. It was discarded. 

(P&Q) Two other measures were taken from the woodwork- 
ing test one in deviation in dimensions from the three-inch square 
and the other in deviations from the true right angles. These 
latter measures were very difficult to make, requiring hours of 
almost microscopic work with a vernier rule, a square and a 
rule calibrated to one hundredths inch. The correlations in both 
cases were low, being -.095 and +.076 respectively. Both meas- 
ures were discarded. It was a source of some surprise that the 
woodworking test did not have more diagnostic value. 

The instructions for each test were given according to an un- 
varying formula so that the results would be as comparable as 
possible. The tests were given in a quiet room in a definite se- 
quence. When the subject was admitted he went in regular suc- 
cession from one test to the next, showing a card containing his 
name and number to each tester. By such devices a constant 
stream of subjects was kept flowing smoothly through the testing 
room. In order to obtain useful resuits, it was thought desirabie 
to use a minimum of sixty records. Consequently over seventy 
men were tested so as to allow for the loss of 


fective records from various causes. 


ten through de- 


A summary -table of the correlations already given is as fol- 
lows: 


TABLE I 


*(A) Criterion with co-ordination test, time —.4238 
*(B) Criterion with co-ordination test, distance +.192 
(C) Criterion with co-ordination test, diagonal time —.396 
(D) Criterion with co-ordination test, diagonal deviation -.034 
(E) Criterion with number checking, rights +.055 
(F) Criterion with number checkig, wrongs —.035 
*(G) Criterion with lock test, time —.199 
*(H) Criterion with box test, time ~.270 
(1) Criterion with box test, errors ~.112 
(J) Criterion with cylinder gauging, time -—.127 
(K) Criterion with cylinder gauging, deviations -.196 


‘The woodworking test was devised by Mr. F. C. Mueller of the 
Psychology Department, University of Wisconsin. 
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5 
TABLE I—cont. 
a (L) Criterion with match square, time +.164 5 
(M) Criterion with circle centering, time 201 
(N) Criterion with circle centering, grades. 273 
(O) Criterion with woodworking, time 119 
(P) Criterion with woodworking, deviations in dimensions —.095 ; a 
(Q) Criterion with woodworking, deviations in angles O76 
re a The variables finally used were renumbered for convenience: 
se 4 Criterion 1 
of ‘ Co-ordination, distance 2 4 
a Co-ordination, time 
th Lock test, time 4 P 
sox test, time 
he C_rcle centering, time 6 y 
The means and standard deviations of these variables are shown i 
in Table II. 
AS 
e- TABLE II 
c- leans. Standard Deviations. Unit of M urement. 
is M,.—84.046 S.D.,=3.689 Percentage grade on work pieces 
nt Mi 6.9406 S.D.= .2653 Minutes-distance** 
on M== S.D.= .3869 Minute 
le M 5.8585 S.D..=5.089 Minutes 
ty % M 1.78 S.D.;=2.627 Minutes 
e- 0165 S.D. 2547 Minutes f 
om (he zero order correlations and intercorrelations are shown in i 


Table III. 
TABLE IIL. 


+.192 

9 

r -.199 Pa—t.142 r + .056 


+.118 r +-.036 +.103 
») 
as 5 The P. E. of each of the corre‘ations in the first column is 
4 roughly .08. 


ws i it will be noted that although the correlations with the criterion 
; are fairly high, the intercorrelations are exceptionally low. More 
. than usual stress was laid on the latter in this investigation be- 
: cause it was thought that this principle is generally observed too 
little. To get the undiluted correlations with the influence of the 
6 . other tests removed, the zero order correlations were subjected to 


*Indicates the tests which were included in the final team. The above r’'s 
1e 4 were all product-moment coefficients, 
**Distance in terms of circumference cf gears on second hand. 
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partial correlation, using Yule’s formula‘ and Rosanow’s’ general 
technique. 


There were some fifty-two of these partials. Only those 
highest order are given in Table IV: 


TABLE IV 
Partial correlations 


12.3456— 4.194 


13.2456— —.363 

14.2356— —.211 
r 

15.2: 

r 

16.2345=——- —.1650 


These partial correlations show but very little shrinkage and 
show that the experiment was unusually successful securing a set 
of tests highly uncorrelated with each other. 

After the partial correlations were found, the coefficient of total 
correlation was computed. This correlation of the criterion with 
all the tests weighted in the best possible manner, was computed 
in two ways to check the arithmetic.’ The formulae were 


2 
l-r ) 
16.2345 


2 
) (Lr ) 
13.456 12.3456 


One correlation was .5549 and the other .5545 showing all the: 
partials to be correct. This correlation is about as large as seems 
possible to secure from such highly selected groups as university 
students. It demonstrates the possibility of detecting aptitude in 
quality of lathe work produced. 

In order to get the maximum yield from a team of tests as 
well as to use a set of test scores to predict probable success in 


‘Rosanow, op. cit., p. 40. 


fiv 
rr x 
1213 23 
r 
2 
6 l-r 6! l-r 4 
\ 23 
of 
4 
d 
v A b 
t 
2 2 2 2 . 
\ 12 13.2 14.23 15.234 aR: 
2 2 2 : 
_ \ 16 15.6 14.56 
+ 
a given vocation, a regression equation must be obtained. The . 
— ‘Yule, Introductfon to the Theory of Statistics. p. 259. 
*Rosanow, Analysis of Mental Function. p. 230-45. 
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general form’ for this equation for six variables (criterion and 
five tests) is: 
SD sD 
1.23456 1.28456 
_ x r —— xX 
2+ 13.2456 "3 
sD 
3.12456 


1.23456 

r 

4+ 15.2346 
SD 


1.23456 
—— 
6 


6.12345 


Where x.:= the deviation of any subject from the mean of that 
variable (here the criterion), where xX:, X:, Xs, Xs, and xX. equal the 
deviation of any subject from the means of the respective varia- 
bles, and where S.D. sc, ete., equal the partial standard devia- 
tions. These partial standard deviations are obtained by a formula 
of the type: 


1.23456 


15.234 16.2845 
The partial standard deviations are shown in Tabie V: 


TABLE V. 


4.881 
.248 
S.D.s.18e— .343 
Substituting these, together with the partial correlation coeffi- 
cient, in the formula for the regression equation and solving we 
have: 


This, however, is not a very convenient form of the regression 
equation for practical use. Substitutions were accordingly made 


‘Yule, op. cit., p. 244. 
‘Yule, op. cit., p. 237. 


is 
SD SD 
1.23456 
SI 
4 4.12356 5.12346 
SD 
16.2345 
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4 
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for the small x’s so that instead of deviations from means were 
had individual scores minus the mean of that variable 
(Xo-M.». The formula then becomes: 


2.397 (X.-M.) -3.252 (X;3-M;) -.132 (X.-M,) -.853 (X;-M;) 
-1.996 (Xo-M.) 


Substituting values for the various means (found in Table III) 
and solving, the following equation was secured: 


X,=73.635 Xe 


It will be noted that the values X. to X. represent gross scores 
of any given subject on the present team of tests while X, repre- 
sents his probable grade in a course such as previously described. 
Now if a person of unknown aptitude in this kind of work should 
be given the above tests, his test scores might be substituted in 
the above equation and his most probable grade in such a course 
be predicted with some precision. To take an actual case, subject 
number 9 made in this team of tests; the following scores: 

Co-ordination, distance—6.6 revolutions 
Co-ordination, time—1.05 minutes 

Lock test==1.17 minutes 

30x test=—=-3.37 minutes 

Cirele centering, time=—.35 minutes 


Substituting these in the equation, the result is: 


X,=73.635—2.397 (6.6) -3.252 (1.05) -.132 (1.17) -1.996 (.35) 


Solving, X,—84.008. This man actually scored 84 in his five 
pleces. 


To lighten the work of prediction, both for this investigation 
and for the one to be carried on with a new group of subjects 
here next semester, a chart was prepared containing all the 
products of the probable test scores and their respective weights. 
Lie scores were then predicted for all the sixty subjects, both to 
confirm the R and to observe the detailed relation of individual 
predicted scores to the actual scores. This is shown in detail in 
the scatter diagram, Fig. 4. 
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FIGURE 4. 


Scatter Diagram. 
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RACIAL DIFFERENCES AS MEASURED BY THE 
DOWNEY WILL-TEMPERAMENT TEST 


By JOHN H. McFapps=n and J. F. DASHIZLL 


INTRODUCTION 


Study of character and personality traits is, by its very nature, 
a perplexing problem, since there are a multiplicity of factors 
entering into the make-up of any given individual, and since the 
organization of these factors varies so greatly with each individual. 
Several tentative classifications of the traits and of the factors 
governing the combinations of the traits have been made. Wood- 
worth' analyzes personality into four factors: Physique, Chemique 
or Temperament, Instincts, and Intelligence. F. H. and G. W. 
Allport® div:de tendencies into: Intelligence; Temperament, includ- 
ing Emotional Breadth and Emotional Strength; Se!f-expansion, 
including Extro-Introversion, Ascendance-Submission, Expansion- 
Reclusion, Compensation, and Insight and Self-Evaluation; and 
Sociality, including Social Participation, Self-seeking and 
Aggressive Self-seeking, and Susceptibility to Social Stimuli. 
Tests for these were given, and the results checked by estimates 
of character and personality given by friends of the subjects 
tested. The above division of tendencies is considered by the 
writers as merely a tentative one which may be of some use in 
guiding further character analysis. Watson’ gives a form for 
analysis of personality, and Wells’ classifies the healthy mental 
reactions as distinguished from unhealthy ones. H. T. Moore and 
A. R. Gilliland’ studied aggressiveness as a personality trait, iso- 
lating it as far as possible by selecting two groups of students, one 
composed of students rated as the most aggressive in their class, 
and the other composed of students rated as the least aggressive 
in their class. The following tests are suggested as measurements 
of aggressiveness: eye movements; distraction from staring, from 
electrical shock, and from visual presentation of a snake; and a 


Woodworth, R. a Study of Mental Life. Henry Holt 
and Co., New York, 1921. 
= 2 Allport, F. H. and Allport, G, W. Personality Traits: Their Claasi- 
fication and Measurement. Journal of Abnormal Psychology and Social Psy- 
chology. Vol. 16, No. 3. April, 1921. 

® Watson, J. B. Psychology from the Standpoint of a Behaviorist. 
J. B. Lippincott Co., Philadelphia. 

* Wells, F. L. The Systematic Observation of the Personality—In [ta 
Relation to the Hygiene of the Mind. Psychological Review, Vol, 21, 1914, 
pp 295-332. 

a § Moore, H. T. and Gilliland, A. R. The Measurement of Aggressiveness. 
7? Journal of Applied Psychology. Vol. 3, No. 2, June, 1921. 
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word association test. In the above tests, the first group scored 
high, and the second group scored less, thus indicating the value 
of the methods used. R. O. Filter’ studied Self-Assurance and 
Speed of Decision, concluding that one test was not sufficient for 
each trait, but that two or more should be given. Also, he con- 
cludes that there was not a very close correlation between speed 
of decision and self-assurance. The test used in the present 
study of racial differences is the Downey Will-Temperament Test’ 
formulated by Dr. J. E. Downey, of the University of Wyoming. 
The particular problem here studied is the compar'scn of, tem- 
perament as found in individuals of the white and negro races. 
Previous studies have been made regarding the comparative in- 
telligence of the two races. Mayo’ finds that, while the “data 
points clearly to a measurable degree of mental difference,” the 
differences are of degree, and not of kind, and that mental varia- 
bility is sl’ghtly less in negro high school students than in white 
high school students. Ferguson’, using the Woodworth and Wells 
Mixed Relations Tests I and II, a form of the Ebbinghaus Com- 
pleticn Test, a Cancellation Test, and one of the Columbia Maze 
Tests, finds the intellectual performance of the general colored 
population to be approximately seventy-five per cent as efficient 
as that of whites, with the possibility of negroes more nearly 
approaching whites in reasoning based upon concrete objects than 
in reasoning based upon mental representations of the objects. 
He finds a practical equality in sense capacity in the perceptive 
and discriminative abilities, and, since the difference must exist in 
the central elaborative powers upon which thought more directly 
depends, he infers that such neural differences as may be found 
will probably be mainly in the constitution of the cortical neurons, 
rather than elsewhere in the nervous system. This point will be 
taken up later. Ferguson also states that, while no certain con- 
clusion as to racial variability can be reached, it would be reason- 
able to suppose that the negroes would vary more than whites, 
since the so-called negro race is not nearly as homogeneous as the 


® Filter, R. O. An Experimental Study of Character Traits. Journal 
of Applied Psychology. Vol. 5. No. 4, December, 1921. 

* Downey, J. E. The Will-Profile, A Tentative Scale for Measurement 
of the Volitional Pattern. Univ. of Wyoming Bulletin, Vol. 15, No. GA, Feb., 
910. 

The Adolescent Will-Profile. Journal of Educational Psychology. Vol 
11, No. 3, March, 1920. 

Some Volitional Patterns Rerealel by the Will-Profile. Journal of 


Experimental Psychology, Vol. 3, No. 4, August, 1920. 


Manual of Directions for Testing. World Book Co., Yonkers-on-Hudson, 
N. 

*‘ Mayo, M. J. The Mental Capacity of the Negro. Archives of 
of Psychology. Monograph No. 28, 1913. 

* Ferguson, G. O., Jr. The Psychology of the Negro Archives of 
I’sychology, Monograph No. 36, April, 1916. 
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white race. Derrick” finds that the negro college student is older 
than the white, that their average intelligence quotient is nine 
points lower than the average white intelligence quotient, and that 
the white college student is about seventy-seven per cent as vari- 
able as the negro college student. 


Turning from previous work on intelligence, and considering 
other studies of racial differences, we find several reports on dif- 
ferent traits. Stetson" finds practically no difference in memory 
in the two races, though there may be some doubt as to the value 
of this contribution. Bache” finds the negro quicker in his reac- 
tion time than the white man, and Garth” finds that there are 
slight differences in their work curves—that there is a tendency 
of the negros to do more of their work in the first half of the 
allotted time than: for the white workers. This would indicate 
that the negroes are more quickly aroused to work, but that, after 
the first spurt, their effort diminishes. Mitchell, Rosanoff and 
Rosanoft" in a study of association, find that negro children, on 
the whole, show further departure than white ones from the normal 
adult association standard. 

Before going on into our particular problem, it is necessary to 
note some cautions which experimental work has given. Pressey” 
warns that grade norms may vary very greatly in various schools, 
that there may be different age distribution, and that other factors 
may enter in. And, in a comparison of students of different races, 
it is all the more important to bear this in mind, since grade norms 
will inevitably vary much more greatly in schools for different 
races than in different schools for students of the same race. 
Arlitt’® finds a tendency for greater variation (in results from 
intelligence tests) in the same race but different social status than 
in the same social status but in different races. This latter point 
will be considered in the next section. Further, as Gatewood” 
points out, there is a tendency to rate negro children above their 
mental status, as compared with the tendency to rate whites at or 
below their true mental status. 


© Derrick, S. M. A Study of the Intelligence of White and Colored 


College Students. Journal of Appi 
cember, 1920. 

" Stetson, C. R. Some Memory Tests of Whites and Blacks. Psychol- 
ogical Keview, Vol. 4, pp. 285-289, 1897. 

2 Bache, R. M. s#eaction Times With Reference to Race. Psychological 
Review, Vol. 2, pp. 475-486, 1895. 

3) 6©Garth, T. R. White, Indian and Negro Work Curves. Journal of 
Applied Psychology, Vol. 5, No. 1, Mareh, 1921. 

“4 Mitchell, 1., Rosanoff, I. R., and Rosanoff, A. J. A Study of Associ- 
ation in Negro Children. Vsychological Review, Vol. 26, September, 1919. 

® Pressey, S. L. The Problem of the Unselected Group in the Stand- 
ardization of Tests, Journal of Applied Psychology. Vol. 5, No. 1, March, 1921. 

6 Arlitt, A. H. On the Need for Caution in Establishing Race Norms. 
Journal of Applied Psychology, Vol. 5, No. 1, March, 1921. 

7 Gatewood, E, L. Teachers’ Estimates of Negroes and Whites. School 
and Society, Vol. 9, pp. 90-91, 1919. 


ed Psychology. Vol. 4, pp. 316-329, De- 


RACIAL DIFFERENCES 


SUBJECTS AND TESTS 


The subjects tested were white and negro high school and college 
students. It was hoped, by choosing such groups, to eliminate 
many of the factors of selection, and to make the groups more 
directly comparable, and also to fix a general level of intelligence 
between the groups. It was found, however, that, in general, 
negro students of each grade were slightly older than white stu- 
dents of the same grade. This is constantly found in comparisons 
of white and negro students. Graphs and tables follow, which 
endeavor to analyze the general results, and to interpret them in 
terms of equal ages and equal grades. These results are based 
upon the records of 38 high school and 39 college students from 
each race. 


The thirty-eight white high school students were tested at the 
Chapel Hill High School, and were chosen at random, nearly all 
of the male students being taken. A note should be made here 
concerning the make-up of this high school, since it is a little out 
of the ordinary. The high school students are drawn from three 
fairly distinct social classes: those from the surrounding farm 
and mill districts, those whose parents are in business in the town 
of Chapel Hill, and those whose parents are members of the 
Faculty of the University of North Carolina. A survey of gen- 
eral intelligence of these students with regard to their respective 
social classes was made in 1917 by Chase and Carpenter and again 
in 1921 by Dashiell and Glenn, who found that the social levels 
and intelligence levels were strongly related. As might be ex- 
pected, the Faculty children led in general intelligence, followed 
by the Town children, with the Out-of-Town group last. Chapel 
Hill High School embraces the 8th, 9th, 10th, and 11th grades, 
with seven years of grammar school work preparatory. 


The thirty-nine white college students were taken from classes 
in psychology at the University, the entire class in Abnormal 
Psychology being tested, and the balance volunteering from intro- 
ductory classes. Except for the very few freshmen and seniors, it 
is a fairly typical group of college students. 


For the negro subjects, students in three negro institutions were 
tested. The National Training School, at Durham, N. C., fur- 
nished five college students and all except two of the high school 
students. There is some doubt as to the validity of all the results 
from this group, since the testing was stopped, after forty sub- 
jects had been secured, upon evidence that some of the students 
were being coached by their fellows. This would naturally affect 
the Reaction to Contradiction test, and it seemed to affect the 
Motor Inhibition and Co-ordination of Impulses tests, since there 
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was an unnatural tendency for the subjects to write excessively 
slowly in the preliminary warming-up (Test II, 3), and, for the 
Co-ordination of Impulses, to take great pains in adhering to the 
space limitations, but to fail in the time limitations. These 
tendencies were not noted with other groups, either of negroes or 
of whites. The doubt concerning coaching, and the fact that these 
tests and the norms for grading them were developed primarily 
for adults, would render the high school comparisons somewhat 
inconclusive, and would emphasize the results from the college 
groups. 

The second group of negro students was from Shaw University, 
in Raleigh, N. C. The men tested were selected by the President 
of that University. Shaw University is one of the best negro insti- 
tutions in the South in regard to college ranking. Eighteen men 
were tested there, during a period of three days. The writer does 
not believe that any of the men in this group were coached. 

The last group of negro students was from the Negro Agri- 
cultural and Technical School, at Greensboro, N. C. This is a 
state institution, but, owing to the fact that agricultural schools 
usually have lower standards of admission than the Colleges of 
Liberal Arts in the various Universities, it is possible that a 
comparison of the men in this group with men from the Univer- 
sity would be unfair. Still, the three negro schools visited are 
typical of the institutions of that race, and the men afford a 
better basis of comparison, being more widely distributed, than 
if all the subjects were taken from one institution. At the 
Agricultural and Technical School, eighteen men were tested dur- 
ing a period of.three days, sixteen of the men being of college 
grade, and two of high school grade. 

Although precautions were taken to standardize the subjects 
in regard to education, it must be admitted that the criticism of 
Arlitt’ regarding differences in social status holds good here 
Derrick” reports that “White boys from average environment go 
to college nearly twice as frequently as negro boys from average 
negro environment”; thus, the negro college students are more 
highly selected from their environment than the white students from 
theirs. The group of negro agricultural college students furnished 
a partial check on this condition, but for the best study in race 
comparison, it would be necessary to select a large number of in- 
dividuals from all environments. The present comparison can 
claim to hold good only for college and high school students of the 
two races. 


% Arlitt, A. H. On the Need for Caution in Establishing Race Norma. 
Journal of Applied Psychology, Vol. 5, No. 1, March, 1921. 

” Derrick, 8S. M. A Study of the Intelligence of White and Colored 
College ——o- Journal of Applied Psychology, Vol. 4, pp. 316-329, De- 
cember, 1920, 
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To better insure uniformity in results, all of the tests here 
reported were given by one of the writers and were graded by him. 
The grades were checked by the same writer and by a student 
trained in the method of grading. P 

It is not thought necessary to give an exhaustive description 
of the Will-Temperament Test, but definitions of the traits scored, 
and a statement of the value of the scores in each, are necessary 
for a full understanding of the results. In the order in which 
they are listed by Prof. Downey”, they are: 

Speed of Movement: Speed of movement of person relative to 
size and age. A score of “10” here indicates a very high speed, 
grading down to “1”, which indicates very slow movement. 

Freedom from Load or Inertia: Tendency to work at one’s 
highest speed without external pressure, little tendency to relax 
speed, quickness in warming up to a task. A score of “10” indi- 
cates practically uniform speed, “1” indicates great increase, which 
may mean that the subject usually works more slowly than he 
could—that he has excessive “load” or weight of inertia. 

Flexibility: Ease and success in readjustment; capacity to 
modify one’s routine reactions. A score of “10” here indicates 
highest success, “1,” practically failure. 

Speed of Decision: Quickness in reaching a decision or con- 
clusion. A slow reaction here may be due to caution or con- 
servatism in weighing the elements involved in a situation, or be 
caused by one’s being side-tracked by irrelevant matters or a 
rambling procedure. Here “10” indicates a speedy decision, “1”, a 
very slow decision. 


Motor Impuision: This trait refers both to impetuosity and 
energy of reaction. Consider the ease with which the brakes or 
inhibitions can be removed, and also the tendency to explosive re- 
action when the brakes are actually off. A “10” here indicates 
explosive or expansive reaction under distraction, grading down 
to a “1,” which indicates inhibition or withdrawal into one’s self. 

Reaction to Contradiction: This refers to the degree of con- 
fidence with which one maintains his opinion against contradic- 
tion. A “9” or “10” reaction signifies an aggressive subject—the 
burden of proof is thrown on the person who does the contradict- 
ing; “7” and “8” are confident reactions, but reasons are cited for 
one’s confidence and the burden of proof is accepted; “5” and “6” 
are tactful reactions; below “5” there is a grading down to com- 
plete failure to assert one’s own opinion. 

Resistance to Opposition: The vigor with which one reacts im- 
mediately to a blocking of one’s purpose. It grades from “10”, a 


» Downey, J. E. Manual of Directions for Testing. World Book Co., 
Yonkers-on-Hudson, N. Y., 1921. 
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strenuous reaction, to “1”, complete passivity in the face of 
opposition. 

Finality of Judgment: Confidence in previous decision, willing- 
ness or unwillingness to reopen a question once closed. A “10” 
indicates refusal to reconsider, or a very rapid reconsideration, a 
“1” indicates extremely slow reconsideration. Sometimes the sub- 
ject spends as much time on review as he does on the original 
decision. The score in this trait should be interpreted while bear- 
ing in mind the score or time spent on the original decision. 

Motor Inhibition: Capacity to keep in mind a set purpose and 
achieve it slowly. It involves power of motor control, imperturba- 
bility, and patience. A score “10” indicates highest success, a 
“1” indicates practically no inhibition. 

Interest in Detail: Attention to details. This trait is not equiv- 
alent to accuracy, but rather it indicates thoroughness, which may 
or may not be accompanied by analytical thought or insight. 
A “10” indicates highest care for details, a “1” shows very 
little care. 

Co-ordination of Impulses: Capacity to execute a double task 
without preliminary trial, capacity to handle a complex situation 
successfully without forgetting either factor involved. This trait 
is probably allied to keeping one’s head in a confusing situation. 
A “10” here indicates greatest success, a “1” indicates practically 
failure. 

Volitional Perseveration: Absorption in a task; willingness 
to keep “plugging away” at it because the examinee sets up a 
goal for himself. A “10” here is also the most desirable score, 
while a “1” indicates little desire to take pains jin perfecting a 
piece of work that the individual takes personal pride in. 

The scores in the Downey tests were obtained by taking the 
records of a large number of individuals, for some given trait, 
and assigning score values in such a way that 10 per cent of 
the persons tested attained each score. Thus, in a given group, 
we could hardly expect to find normal distribution curves for the 
various traits, but rather a tendency for an equal number to 
attain each score. Further, the total score made on the test is 
not as significant as the grouping of the scores in the various 
traits, although it has been considered that a high total score 
indicates a “strong” personality, and a low total score indicates 
a “weak” personality. Also, the tests and grading norms were 
devised for adults, and this should be taken into consideration 
when these tests are given to younger subjects. 

G. M. Rush” gives a criticism of the Will-Temperament 


= Qos, G. M A Preliminary Study of the Correlations Between 
Estimate of Volitional Traits and the Results from the Downey “ Will-Profile.’” 
Journal of Applied Psychology, Vol. 5, No. 2, June, 1921. 
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Test, in regard to correlation of the results and estimates. He 
finds, for fifteen subjects, very little correlation between the 
estimates of the traits and the scores in those traits as tested 
by the Will-Temperament Test. Prof. Downey”, however, 
using a slightly different method of estimates, found sufficient 
correlation to justify the tests. 


RESULTS AND INTERPRETATIONS 
Table I contains the summary of means and medians of the 
scores in the various traits for the two races, divided into college, 
high school, and combined groups, with the standard deviation 
from the mean in the various traits for the two college groups. 


TABLE I. 
College Students High School Combined 
WHITE STUDENTS 
Trait Mean S. DJ Median Mean Median Mean Median 
Speed of Movement ........ 6.77. 2.5381 7 5.46 5 


Freedom from Load ........5.41 2.527 6.71 7.5 6.06 6 
4.13 4 4.65 5 
Speed of Decision ............ 4.10 2.888 3.21 3 3.66 3 
Motor Impulsion .............. 4.51 1.781 3.76 4 4.14 
Reaction to Contradiction 5.79 2.451 5.50 6 5.65 
Resistance to Opposition 5.77 2.731 5.50 7 5.63 
Finality of Judgment ......4.92 2.392 5.60 5 
Motor Inhibition .............. 5.41 3.409 4.60 5 
Interest in Detail .............. 7.33 1.489 6.44 
Co-ordination of 
Volitional Perseveration..6.64 2.956 
NEGRO STUDE 
Trait Mean 8. D. Me 
Speed of Movement .......... 4.10 3.070 


Freedom from Load ........ 5.48 2.716 
Speed Decision.............. 87 3.048 
Motor Impulsion ..............5.15 1.955 
Reaction to Contradiction 6.10 1.997 
Resistance to Opposition..5.33 2.641 
Finality of Judgment ...... 4.77 2.301 
Motor Inhibition ............. 3.84 3.519 
Interest in Detail ............. 6.82 2.146 
Co-ordination of 
$887 2.8388 3 2.13 
Volitional Perseveration...5.69 2.524 6 8536 7 7 


or ore 


3.05 
5.81 
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5.87 
4.23 
4.71 
5.47 
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Downey, J. E. The Wiil-Profile, A Tentative Scale for Measurement 
of Fes Volitional Pattern. Univ. of Wyoming Bulletin. Vol. 15, No. 6A, Feb. 
919. 
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Attention must be called to the fact that a composite profile 
plotted from mean or median scores, as given in Figure 1 and 


FIGURE 1. Mean Scores. 
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Flexibility VIII 
Speed of Decision 1 


Resistance to Opposi 
Finality of Judgment XIII 
Motor Inhibition VII 
Interest in Detail IX 
Codrdination of Impulses V 
Velitional tion VIII2 


a. College students b. High school students 

aM 

Freedom from Load II-1, 2; VI-1,2 
Flexibility VIII 
Speed of Decision I 


Resistance to Opposition XII 
Fiaality of Judgment XIII 
Motor Inhibition VII 
Interest in Detail IX 
Cosrdination of Impulses V 
Welitiooal Perseveration VIII-2 
Combined 


Figure 2, must be read with caution, since there is so great pos- 
sibility of individual variability in the scores in each trait. Also, 
these profiles do not indicate, more than in a very general way, 
tendencies toward racial patterns. However, from Table I, and 
Figures 1 and 2, we may say in general that the “average” 
white student has greater speed of movement, that he has slightly 
greater freedom from load—that is, he tends to work on the 
weighing of the evidence, or less decisiveness; that he is slightly 
average more nearly at his greatest speed; that he has slightly 
greater flexibility and ability to meet new conditions in life; that he 
is slightly slower in making decisions—which may mean a better 
less liable to become explosive under distraction from conflicting 
stimuli; that he is less forceful when contradicted, which may mean 
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a greater degree of social adjustment (a score:of “5” is the tact- 
ful response), or a greater degree of suggestibility; that he is 
firmer when confronted by obstacles or physical opposition; that 
he takes about the same time, or slightly longer in reconsidering 


FIGURE 2. Median Scores. 
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Resi to Opposition XII 
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Motor Inhibition VII 
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¢@. Combined 


his decisions, which may be due to greater care, or to greater 
indecision; that he has, as a college student, greater ability or 
desire to restrain his impulses, and, as a high school student, less 
ability or desire (but here must be remembered the criticism given 
above, that there was some evidence of coaching upon the results 
from negro high school students) ; that he has practically the same 
interest in detail; that he has a markedly greater ability to co- 
ordinate his impulses, or to meet confusing situation; that he 
has a slightly greater tendency to volitional perseveration 

Figure 3 shows the distribution of the individual scores in each 
trait, for the high school students, the college students, and the 
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combined groups, 6f both races. Score values are indicated on 
the horizontal axis, number of individuals on the vertical. 

The median profile of the high school students (Figure 2b) 
shows emphasis on the traits which combine to suggest an azgres- 
sive personality, a result which was found by Dr. Downey” 
in testing high school freshmen. 

A point may here be brought out in relation.to the negro’s 
tendency to resist contradiction more strongly than the white. 
Dr. Bevis", tabulating the frequencies of various forms of 
psychoses among the Southern negroes, reports that dementia 
praecox stands at the head of the list of their psychoses, with the 
catatonic form occurring about twice as often as in the white. 
Since one of the characteristic symptoms of the catatonic form 
of dementia preacox is negativism, the result of this test checks 
fairly with Dr. Bevis’ report. Also, (and cf. No. 6 in “Summary”) 
we might expect a greater degree of aggressiveness from the 
negro when he is allowed to reveal his inner self. As a race, 
they are generally oppressed or held down, a fact which might 
lead us to look for a reaction in the direction to greater aggres- 
siveness when unchecked. Observations of Northern negroes have 
been reported, where the negro, freed from race restraints, has 
swung to the other extreme and has been rather too aggressive; 
and the tendency of the negro drivers to beat their mules more 
mercilessly than do white drivers, is a matter of observation. 
This is apparently a case of “inferiority complex” and “com- 
pensation”. 

The lower score of the white students in motor impulsion might 
have been expected. Odum” says, “The two general character- 
istics, then, the negro’s emotional nature are the lack of restraint 
and the consequent extreme expression of the feelings.” These 
would naturally be allied with greater motor impulsion. 

Various investigators have made different reports regarding 
the variability of the negro as compared with the white. Fergu- 
son” says that no certain conclusion as to racial variability can 
be reached, adding “It may finally be remarked with regard to 
the relative variability of whites and negroes, that it would not 
be at all surprising if groups of so-called negroes were definitely 
shown to be more variable than comparable groups of whites. For 
the groups that are generally called negroes are composed of in- 


* Downey, J. E. The Adolescent Wiil-Profile. Journal of Educational 
Psychology, Vol. 11, No. 3, March, 1920. 
. * Bevis, W. M. Psychological Traits of the Southern Negro. 
Jounal of Psychiatry, Vol. 11, No. 1, July, 1921. 

* Odum, H. W. Social and Mental Traits of the Negro. Columbia 
University Studies in History, Economics and Public Law, Vol. 37, No. 3, 
Whole No. 99, 1910. 


** Ferguson, G. ©O., Jr. The Psychology of the Negro. Archives of 
Psychology, Monograph No. 36, April, 1916. 
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FIGURE 3. 
Distribution of Individual Scores in the Different Traits 
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FIGURE 3. (cont.) 
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(cont. 


FIGURE 3. 
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dividuals ranging from pure negroes to persons almost white, 
and it would be reasonable to suppose that such groups would 
vary more than would homogeneous white groups.” Mayo” states 
that mental variability is less in negro high school students than 
in white high school students, the ratio being 98 to 100. Derrick’ 


TABLE II. 


Comparison of the Standard Deviations of the Scores Made by the 
White and Negro College Students in the Various Traits 
White Negro White Per Negro Per 
Trait S.D. S. D. Excess Cent Excess Cent 
Speed of Movement ..................2.531" 3.070 539 21.3 
Freedom from Load ................. 2:587 2.716 ate 
.740 2.000 14.9 
Speed of Decision 2.888 3.045 157 
Motor’ Emipulsion. 1.781 1.955 174 9.7 
Reaction to Contradiction ......2.451 1.997 .454 22.2 
Resistance to Opposition .........2.731 2.641 .090 3.4 
Finality of Judgment ..............2.8392 2.301 .091 3.8 
Motor Inhibition 3.409 3.519 16. 
Enterest in Detail 1.489 2.146 657 44.1 
Co-ordination of Impulses ...... 2.882 2.838 .044 1.5 
Volitional Perseveration ......... 2.965 2.524 .441 18.2 
Average Per Centage 


bo bo bo 


finds the opposite in regard to college students, when compared 
" in respect to intelligence. He says “The white college student was 
about 77 per cent as variable as the negro.” Nor is anything 
definite concerning variability found in the results from the Will- 
fa Temperament Tests. Inspection of the distribution curves for 
, : the scores in the various tests (Figure 3) shows practically no 
| \. preponderance of either race. Comparison of the standard devia- 


\tions from the mean in the scores made by college students, 

ble II, shows that the negro deviations exceed those of the 

whites in seven of the traits, and that the amount of excess is 

V relatively greater. There would, then, seem to be a slight but 
not striking tendency for the negro to be more variable. 

After the comparisons giyen above were made, the results were 

analyzed. in terms of ages and academic grades. Except for the 

: preponderance of freshmen in the white high school group, and 


** Mayo, M. J. The Mental Capacity of the American Negro. Archives 
of Psychology, Monograph No. 28, 1913. 
* Derrick, &. M. A Study of the Intelligence of White and Colored 
i College Students. Journal of Applied Psychology, Vol. 4, pp. 316-329, De- 
cember, 1920. 
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TABLE III. 


Distribution of subjects by ages and grades. 
WHITE STUDENTS 
High School College 
Fresh. Soph. Jun. Sen. Fresh. Soph. Jun. 
1 


1 
4 4d 4 23 
NEGRO STUDENTS 
High School College 
Fresh. Soph. Jun. Sen. Fresh. Soph. Jun. 


i2 


45 
Age Sen. ae 
10 ~ 
11 
12 
| 13 5 or 
14 2 1 1 : 
15 5 1 1 4 
16 3 1 3d 
17 1 1 2 1 
: 18 2 1 1 1 2 5 + 
19 1 1 6 3 1 - 
20 1 1 5 3 : 
| 21 2 1 3 4 Pang 
22 1 2 
23 1 1 
24 1 
25 
26 
Age Sen. ack 
12 1 
13 
14 2 
15 1 
16 1 
17 2 1 Pht 
18 3 5 1 1 1 ie 
19 2 2 1 1 1 2 shane 
20 1 2 1 3 2 ae 
21 1 1 2 2 1 aa 
22 2 1 1 Beit 
23 1 1 3 1 1 2 ne 
24 2 1 1 2 2 | es 
25 1 1 1 sel 
26 1 1 2 ae 
28 1 
Total 9 8 11 9 7 12 
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TABLE IV. 


scores by ages of subjects. 

WHITE STUDENTS 

SD Imp. RC RO FJ Inh. ID CI VP 

3.38 3.90 5.81 4.81 5.66 5.76 6.76 4.38 6.28 

4.20 4.79 5.00 6.14 5.38 5.24 7.14 5.20 6.07 

3.48 3.69 5.91 5.69 4.78 4.17 6.82 4.56 6.52 
23-28 2.25 3.25 7.50 6.00 5.00 3.75 6.25 5.75 5.50 

NEGRO STUDENTS 
Age SM FL SD Imp. RC RO FJ Inh. ID CI VP 
10-16 2.60 3.40 4.20 4.80 5.40 6.00 5.40 7.00 7.20 6.80 2.00 6.00 
17-19 2.35 5.74 3.87 4.65 5.49 6.69 5.17 6.09 5.26 5.65 2.69 6.22 
20-22 5.15 6.30 4.10 5.00 4.85 6.10 5.00 5.95 4.50 6.95 4.00 5.40 
23-28 3.62 5.58 4.31 3.24 5.55 6.48 5.31 4.55 5.52 5.34 2.76 5.79 


For number of subjects in each group, and grades represented, 
see Table 3. 


Figure IV. 
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of sophomores in the white college group, the distribution of stu- 
dents is approximately the same in both races. Table III shows the 
distribution by age and grades. From inspection of this table, 
it is seen at once that the negro student is from two to three 
years older than the white student of the sarmhe grade. This 
retardation may be caused by adverse environment or by inferior 
intelligence of the negro. Probably both factors are at work. To 
eliminate as much as possible the influence of age on the scores 
made by the groups, the subjects were divided into four sections, 
the first section containing those 16 years of age and younger, 
the second section containing those from 17 through 19, the third 
those from 20 through 22, and the fourth those 23 and above. 
Table IV gives the mean scores made by each section in each 
trait and Figure 4 shows the profiles plotted from these data. It 
will be seen from inspection that, as the subjec%s grow older, there 
is a tendency for increased scores in speed of movement, fiexi- 
bility, reaction to contradiction, and co-ordination of impulses; a 
tendency for decreased scores in speed of decision, finality of 
judgment, motor inhibition, and perhaps volitional persevera- 
tion; the scores for motor impulsion tend to remain about the 
same; in freedom from load, the white scores remain about the 
same, or decrease slightly, while the negro scores increase with 
age; in resist: opposition, the white scores increase, and the 
negro scores 1? about the same; in interest in detail, the 
white scores decrease slightly, while the negro scores vary. The 
increase in speed of movement may be due to increased use of writ- 
ing, the necessity for rapid note-taking, or the influence of other 
situations in every-day school life that call for speed in writing. 
The increase in the three other traits noted would probably be due 
to a deeper realization of problems as such, and the consequent 
tendency to eliminate “snap judgments”. The decrease in motor 
inhibition and volitional perseveration may be due to the greater 
value which the older person puts upon time, or, in the case of 
motor inhibition, the decrease of eontent which the test-phrase 
holds for him, and the consequent increase in difficulty of the 
test. A person’s rating in motor impulsion is probably a 
fundamental part of his make-up, and would hardly be expected 
to change in either direction. A true race difference may be 
indicated by the difference in the change of scores for freedom 
from load, in which those for the whites remained nearly constant, 
and those for the negroes increased. The negro score in this 
trait never increased enough, however, to bring it up to the lowest 
score of the whites. In such a trait as resistance to opposition, 
the increase in white scores, while the negro scores remain nearly 
constant, may indicate an earlier maturity on the part of the 
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negro. The decrease in scores for the white students in interest 
in detail may indicate, as noted above, an increased valuation of 
time. 

In order to check the above classification of the groups into 
age-sections, another classification was made, this time into grade 
sections. Here it is evident, the factors of intelligence and greater 
capability come into play, and we may be fairly sure that the 
higher the grade, the higher the intelligence level, since there 
is a selective factor at work to eliminate those of less native 
capacity. Table V gives the mean scores attained by the various 


TABLE V. 


Mean scores by school grades of subjects. 
WHITE STUDENTS 
Grade FL F SD Imp. RC RO FJ Inh. ID CI VP 
HS 1-2 3.56 6.46 3.93 2.76 3.70 5.60 5.50 5.36 4.36 6.16 3.90 5.66 
HS 3-4 6.25 7.62 4.87 4.87 4.00 5.12 5.50 6.50 5.50 7.50 3.62 5.85 
C 1-2 7.04 5.74 5.04 4.00 4.33 6.89 5.74 4.88 6.96 7.22 6.44 6.58 
C 3-4 6.08 7.66 5.41 4.33 4.91 4.83 5.83 5.00 4.17 7.58 5.08 6.75 
NEGRO STUDENTS 
Grade FL F SD Imp. RC RO FJ Inh. ID CI VP 
HS 1-2 2.09 5.09 3.38 4.33 5.57 6.47 5.00 6.81 6.38 4.91 2.33 5.71 
HS 3-4 4.06 6.82 4.23 5.18 5.35 7.23 5.12 5.65 7.23 7.29 1.88 5.88 
C 1-2 4.40 5.25 4.05 4.75 4.95 5.85 5.65 5.20 5.45 6.15 3.85 5.30 
C 3-4 5.79 3.95 2.95 4.84 6.37 5.42 4.21 3.79 7.58 3.89 6.10 
Average ages of above groups: 
HS 1-2 HS 3-4 C 1-2 C 3-4 
Whites . 17.0 19.74 20.33 
Negroes 17.66 21.59 21.2 25.58 


For distribution of ages, and numbers in each group, see Table 3. 


sections, and Figure 5 shows the profiles ‘plotted from the data. 
In this case, with increase in grade, there is an increase in the 
scores of more traits than when the student’s increase is in age. 
Inspection shows that there are increases in the scores for speed 
of movement, flexibility, resistance to opposition, interest in detail, 
co-ordination of impulses and volitional perseveration. Finality 
of judgment is the only trait in which there is a decrease. The 
scores for freedom from load, reaction to contradiction, and motor 
inhibition remain about the same, or vary irregularly. There is 
a slight increase in the scores for speed of decision and motor 
impulsion for the whites, with a decrease in these traits for the 
negroes. With increased intelligence and stronger personality, one 
would naturally expect higher scores. For finality of judgment, 
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the decrease is from above “5” to about “5” which is the normal 
score. The advanced students seem to be less addicted to hasty 
revision. The constancy of the traits freedom from load, reaction 
to contradiction, and motor inhibition, may indicate that normal 
scores in these traits are sufficient for proficiency’in school work, 


Figure V 
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although it would seem that the advanced student would gain in 
resistance to contradiction. The opposite direction in which the 
scores tend, for the two races, in speed of decision and motor 
impulsion is probably indicative of racial characteristics, and may 
offer good returns for further investigation. 

Since, as statedsin the first part of this section, a study of mean 
or median scores offers little that is definite in the way of racial 
temperament patterns, the scores for the college students were 
gone over in an attempt to list any tendency toward groupings of 
the high scores found there. Following Professor Downey in the 
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main, we may suggest that there are three types of such group- 
ings—those in the traits listed as the first four characterizing one 
as mobile or rapid-fire in organization, as the second four charac- 
terizing him as aggressive or forceful, and as the third four the 
controlled, deliberate, careful type. Also, there may be combina- 
tions of any two groupings, or of none at all—the profile may 
be balanced or else very irregular. The method followed here 
in listing tendencies was as follows: The first four scores in 
each individual’s case were added together, as were the second 
and third sets of fours. These sums were compared, and when 
the sum of one group exceeded the sum of either of the other 
groups, the individual was listed as being characterized by the 
tendency correlated with that group. Where no one group ex- 
ceeded either of the others by three or more points, there were 
two possibilities—either that the individual had a well balanced 
profile, or else there was a combination of two high groups and 
one low one. If the first was the case, he was listed as “Balanced”; 
if the second was the case, he was listed under the combination 


TABLE VI 


Showing distribution of college students according to the group of 
traits in which each received his highest scores. 
1st 2nd 3rd 1st and 2nd 2nd and 3rd ist and 3rd Balanced 


Whites ..4 6 14 1 6 5 3 
Negroes 2 9 8 4 8 3 5 


of his two high groups. Table VI gives the results of this attempt 
to classify the patterns. From this table we may see that the 
whites slightly surpass the negroes in the number of mobile, rapid- 
fire individuals; they have a clear superiority in the number of 
controlled, deliberate, careful persons; and they slightly surpass 
the negroes in the number of individuals showing a combination 
of these two characteristics. The negroes slightly surpass the 
whites in the number of aggressive persons and in the number 
of persons combining quickness and mobility with aggressiveness, 
and also in the number of individuals combining aggressiveness 
and deliberation. 


As a last point, Professor Downey suggests that, while there is 
not as much significance attached to the total score of each individual 
as to his scores on various traits, still, a hig total score is, in 
general, indicative of a “stronger personality,” and a low total 
score is indicative of a weaker personality. On this assumption, a 
tabulation of total scores of the college group was made, and*a 
distribution curve was drawn from the results. See Figure 6. 
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FIGURE 6. 


Distribution of total scores of College students. 
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Mean Median 
White 68.08 70 
Negro 59.05 57 


Six negroes (15.4%) passed the white median. 


It was found that the white college students surpassed the negro 
college students in their totals, which, therefore, would indicate, 
that, in general, the whites have “greater force of personality.” 
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= SUMMARY 


1. In general, the negro is slower in movement than the white, 
has about the same or slightly greater load or inertia; has slightly 
less flexibility; is slightly quicker in making his decisions; his 
motor impulsion is slightly greater; reacts more firmly against 
contradiction; but offers less resistance to physical opposition; 
takes about the same or slightly less time in reconsidering his 
decisions; the negro college student has less motor inhibition, the 
negro high school student has greater, compared with white stu- 
dents; the negro has practically the same interest in detail as the 
white, has markedly less ability in co-ordinating his impulses; and 
has a slightly less tendency to volitional perseveration. 


2. There is little difference in variability, but the negro college 
student has greater variability on seven of the traits, as opposed 
to the greater variability of the whites in five, and the amount of 
the negroes’ variability is slightly greater. 


3. In general, the negro college and high school student is 
from two to three years older than the white student of the same 
academic grade. 


4. As to the influence of age (from early adolescence to the 
middle twenties, as the subject, approaches maturity, there is a 
tendency for increased scores in speed of movement, flexibility, 
reaction to contradiction, and co-ordination of impulses; a tendency 
for decreased scores in speed of decision, finality of judgment 
motor inhibition, and volitional perseveration, the scores for motor 
impulsion tend to remain about the same; in freedom from load, 
the white scores remain about the same, or decrease slightly, and 
the negro scores increase with age; in resistance to opposition, the 
white scores increase and the negro scores remain about the same; 
in interest in detail, the white scores decrease, while the negro 
scores vary. : 


5. As to the analysis of mean scores by grades (probably in- 
fluenced by the factor of increasing intelligence in direct ratio to 
the higher grades): the scores for speed of movement, flexibility, 
resistance to opposition, interest in detail, co-ordination of im- 
pulses, and volitional perseveration increase in the higher grades; 
the scores for freedom from load, reaction to contradiction, and 
motor inhibition vary; there is a slight increase in the scores for 
speed of decision and motor impulsion for the whites, with a de- 
crease in these traits for the negroes, from the lower to the higher 
grades. 

6. In predominance of temperamental patterns, the whites 
slightly surpass the negroes in the number of mobile, rapid-fire 
individuals; they have a clear superiority in the number of con- 
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trolled, deliberate, careful persons; and they slightly surpass the 
negroes in the number of individuals showing a combination of 
these two characteristics. The negroes slightly surpass the whites , 
in the number of aggressive persons and in the number of indi- 
viduals combining quickness and mobility with aggressiveness, and 
also in the number of individuals combining aggressiveness and 
deliberation. 


7. In “strength of personality,” as indicated by the total score 
of each individual, the whites surpass the negroes, only 15.4 per 
cent of the negroes exceeding the median of the whites. 
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A STUDY OF THE DATA ON THE RESULTS GATH- 
ERED FROM REPEATED MENTAL EXAMINA- 
TIONS OF 200 DEFECTIVE CHILDREN 
ATTENDING SPECIAL SCHOOLS 
OVER A PERIOD OF EIGHT 
YEARS 
By Meta ANDERSON 
Director Binet Classes, Newark, N. J. 


Although the work for defectives was begun in Newark in Sep- 
tember, 1910, it was not until 1913 that the organization of the 
work permitted anything like systematic re-examination of the 
children admitted to the special schools and even then complete 
yearly re-examination of the whole special school population was 
not accomplished. However, since 1916 it has been possible to 
re-examine each child in the special schools yearly excepting a 
few cases which will be noted later. The object of the work was 
to get information in regard to the growth of intelligence of 
the children in the special classes (now called Binet Classes) in 
order, if possible, to know better how to train them and to 
eliminate errors in their training and education. The object of 
this study was to organize, classify and analyZe these data in 
order to get further information in regard to types of mental 
growth curves and in order to get further information in regard 
to the value of the intelligence quotient. 

While the population of the Binet Schools is upward of five 
hundred pupils only two hundred were studied here. These two 
hundred cases vary from the grade of low grade imbecile to 
that of high grade moron. Before January, 1918, the cases were 
selected for the Binet schools because they were failures in the 
regular grades and at the same time fell more than three years 
below normal according to an intelligence test (this standard 
being set by the New Jersey law). Since January, 1918, the 
cases have been selected by the Psycho-Educational Clinic of the 
city. Each case is certified as being definitely feeble-minded 
before being sent to a Binet School. 

Each case was examined from three to eight times at not less 
than a year’s interval. The majority of the cases were re- 
examined by the writer. The others were examined by two head 
assistants in the department both of whom were unusually well 
trained to do the work. 


In a department always having a population of 500 defective 
children it would seem possible to have numbers of retests on 
at least that many children. This investigation has shown that 
the turnover in the population of the Binet Schools is very great. 
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Many of the children except the lowest. grade are assigned to 
special work late in their school career. These children are not 
presented for psychological examination until the regular schools 
have done all in their power to help them succeed in their work. 
Hence, it is only after their inability to learn has become most 
marked or their behavior disorders have become unbearable that 
these children are assigned’ to the Binet Schools. By that time 
the children are anywhere from ten to fourteen years of age. 
Since they leave school at sixteen and sometimes earlier as 
during the war when the pressure for workers was very great 
these children have short opportunity for training. This also 
renders it impossible to have a great number of retests on the 
same individual. 

A certain percentage of this group also leave school to go to 
houzes cf correction, to institutions, move from the city or are 
lost track of. Therefore the number remaining in the Binet 
Schools for more than four years is comparatively few. 

With the present wider use of group tests defective children 
ave being discovered earlier in their school career but it will be 
necessarily a number of years before sufficient data are gathered 
on this new group for a worthwhile study. 

Before 1918 the Goddard revision of the Binet tests were used, 
since 1918 the Stanford revision of the Binet tests have been 
used. 

The study falls into two parts. The first part has to do with 
the frequency of types of mental growth curves and the second 
part has to do with a study in values of the IQ. 
PART I. FREQUENCY OF TYPES OF MENTAL GROWTH 
CURVES 


OTHER INVZSTIGATIONS. 


Kuhlman has' made growth curves on data gathered from re- 
peated retests on inmates of the Minnesota institution and he 
says: “The main features of these growth curves present but 
little that is not now pretty well known. They show that the 
feeble-minded do develop mentally, and at a rate in proportion 
to the grade of intelligence. This was still a disputed point at 
the time this study was begun. This rate of development de- 
creases with age for all grades. The rate of this decrease 
cannot, of course, be shown with this data, as we do not know 
how much mental growth in terms of absolute units the mental 
year represents at different age levels.” 

Kuhlman summarizes his results on mental age growth as 
follows: 
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“1. The mental ages of the feeble-minded increase with age 
at a rate proportionate to the degree of mental deficiency. 

“2. On the whole the mental age ceases to increase between 
the ages of fifteen to eighteen, the idiot grade ceasjng develop- 
ment about three years earlier than the borderline grade. 


“3. For the cases figuring in the present study and for the 
time period they were under observation, 4.8 per cent of the 
cases gained ‘twelve months or more in mental age in a year, 68 
per cent gain from one to eleven months a year, 11 per cent 
gain-or lose less than one month a year, and 16 per cent lose 
from one to seven months a year in mental age. 

“4. The lower grades lose more frequently in mental age than 
the higher grades. 

“5. The frequency in loss in mental age increases with age, 
independently of grade.” 

Doll says on the rate of growth for subnormals: “Not much 
more is known about the rate of mental growth among extreme 
subnormals, like the feeble-minded, than about the age of arrest. 
In this regard we are even at a loss for adequate observational 
data. Experience with institutional feeble-minded, however, sug- 
gests a great variety of possible forms of retardation curves. 
Such experience points to a small but significant group of feeble- 
minded who grade ‘at age’ by the Binet scale when first ex- 
amined but who make no material progress in later examinations 
over a period of years (Type A.) There are others (Type B) who 
at the time of first examination test at age, then continue to 
develop at a retarded rate for a time and then reach a point of 
arrest. There is a third group (Type C) already retarded when 
first observed, who from the first examination to the age of final 
arrest develop at an annual rate of growth which is a con- 
stantly decreasing variable. There is also a fourth group (Type 
D) who from the time of first examination to the age of final 
arrest. There are many variants and combinations of these 
basic types for individual subjects. We believe, however, that 
these four types comprise the majority of the feeble-minded. It 
is conceivable, though not probable, that some individuals might 
have a higher than average rate before retarding. It is also con- 
ceivable, though not likely, that the arrest might be relatively 
greater in the early ages with a curve of growth which would be 
accelerating at some intervals. It remains for experimental in- 
vestigation to show the relative frequencies of these different 
types of growth curves.’” 


*Kuhimann, Results of Repeated Re-examinations of Feeble-minded (639) 
Over a Period og Ten Years. Journal of Applied Psy. Sept., 1921. 
2Doll, Growth of Intelligence. Psy. Monograph, pp. 15-19. 
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With the view of determining what light, if any, the limited 
data on hand could throw on the subject of the growth of intelli- 
genee of feeble-minded subjects 100 mental age growth curves 
of pupils attending the Binet Schools from three to eight years 
were plotted. These curves were then classified according to 
Doll’s theoretical intelligence growth curves as follows: 

Type A. (Doll) Those who test “at age” by the Binet scale 
when first examined but make no material progress in later ex- 
aminations. 


Type B. (Doll) Those who at the time of first examination test 
“at age,” then continue to develop at a retarded rate for a time 
and then reach a point of arrest. 

Type C. (Doll) Those who are already retarded at time of 
first examination but who continue to develop from the time -of 
first examination until time of final arrest at an annual rate of 
growth which is a constant*y decreasing variable. 


Type D. (Doll) Those who from the time of first examination 
to the age of final arrest develop at a constant rate up to time 
of final arrest. 

For convenience two new types were added although it is quite 
likely they could be easily included in ore of the above groups. 
They are as follows: 

Type E. (New) Those who are retarded at the time of first 
examinaticn, and who also have apparently reached the time of 
final arrest. 


Type F. (New) Those who have developed at a constant or 
variable rate up to the time of apparent arrest, but who after 
two to four years of apparent arrest, have begun to develop 
again. 


NUMBER OF CASES 


Of the 11 cases listed as type f there were 
4 who remained at an apparent arrest for 2 years. 

‘ 5 who remained at an apparent arrest for 3 years. 

2 who remained at an apparent arrest for 4 years. 
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Of these 11 cases listed as type f there were 

2 cases who continued development for 1 year after arrest. 
6 cases who continued development for 2 years after arrest. 
2 cases who continued development for 3 years after arrest. 
1 case who continued development for 4 years after arrest. 


at which time there was another apparent arrest or the children: 
left school. 


Type f cases classified according to life age 
Number of Cases 


Type f cases classified according to mental age 
Number of Cases Mental Age 


SUMMARY. 


1. It is evident that a greater number (47) of the cases studied 
in this group have developed mentally according to Doll’s theo- 
retical curve described as Type C. 


2. The next larger group (22) of cases studied have developed 
mentally according to Doll’s theoretical curve described as Type D. 
3. There were 67 cases out of the 100 studied who were re- 
tarded when first examined and who then retarded either at a 


constant rate or at a steadily decreasing rate up to age of final 
arrest. 


4. Only one (1) case could be obviously classified under 
Type A. 

5. The 19 cases which were retarded when first examined and 
which developed no further at later examinations may or may 
not belong to Type A.That is, it is possible that if they had been 
examined at earlier periods they would have tested “at age.” 
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6. The 11 cases under Type F could be~classified under Type 
C in a broad classification. They differ from that type merely 
in the fact of having long periods of arrest but in the long run 
they develop at a decreasing rate up to the time of final arrest. 


7. It would appear from the study of these 100 cases that 
the majority 80 in this study of feeble-minded cases slow up 
gradually in mentality up to time of final arrest and that com- 
paratively few develop normally for a time and then stop sud- 
denly. In this study not more than 20 of the cases could have 
been said to have come to sudden arrest and 19 out of the 20 
cases are doubtful. 

8. The 100 cases studied were all classified as feeble-minded 
and were attending Binet Schools. It is possible that feeble- 
minded cases which develop normally and then retard suddenly 
are not discovered early enough in their school careers to be 
selected for Binet Schools. 


Part II. Stupy in Some VALUES OF THE I. Q. 


From three to seven tests were made on 200 children attending 
the Binet Schools at intervals of a year in most cases. There 
are cases which are noted in which the retests were made at 
greater intervals because the children were not available for 
examination either because they had moved from the city or 
attended another school and then re-entered the Binet School 
when they were re-examined. At the time of this study there 
were only two hundred cases who had been re-examined a suffi- 
cient number of times to make the study of their cases worth 
while. These cases then were all selected because they were 
attending the Binet Schools and because they had attended long 
enough to have from two to six retests made at intervals of a 
year. 

The cases were classified and grouped. 


(a) According to life age at initial test and I. Q. in groups 
of tens. 


(b) According to I. Q. in groups of tens. 


(c) According to I. Q. and intervals of retests regardless of 
life age. 


These groupings were made in order to study the I. Q. in its. 
various aspects. 
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A. CASES CLASSIFIED ACCORDING TO LIFE AGE AT INITIAL TEST 
AND I Q—IN GROUPS OF TENS. 
No. of Cases. Life Age. Median I. Q. Number Median I. Q. 
at of at 
First Test. Retests. Last Test. 


3842 44% 
39 41 
33 44 
31 25 


ages 
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A. CASES CLASSIFIED ACCORDING TO LIFE AGE AT INITIAL TEST 
AND I Q—IN GROUPS OF TENS. (CONTINUED ) 


No. of Cases. Life Age. Median I. Q. Number Median I. Q. 


at of at 
First Test. Retests. Last Test. 
2 6 74 4 60% 
1 7 70 7 60 
2 8 75 4 65 
6 i) 74 6 48 
5 5 
8 3 
5 3 
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SUMMARY OF RESULTS. 


1. The cases with I. Q.’s from 30 to 39, inclusive, and with life 
ages from six to nine years, inclusive, showed a gain in median 
I. Q. of 2 to 11 points. 

2. The cases with I. Q.’s from 30 to 39, inclusive, and with life 
ages from 10 to 13, inclusive, showed a loss of 6 to 7 points in 
median I. Q. 

3. The cases with I. Q.’s from 40 to 49, inclusive, and with life 
ages of 6 years showed a loss of 2 points in median I. Q. 

4. The cases with I. Q.’s from 40 to 49, inclusive, and with 
life ages from 8 to 11, inclusive, showed a gain in median I. Q. 
of 2 to 4 points. 

5. The cases with I. Q.’s from 40 to 49, inclusive, and with the 
life ages of 12 showed a loss in median I. Q. of 5 points. 

6. The cases with I. Q.’s from 50 to 59, inclusive, and with life 
ages from 6 to 12, inclusive, showed a loss in median I. Q. of 
nothing to 9 points. 

7. The cases with I. Q.’s from 60 to 69, inclusive, and with life 
ages from 6 to 13, inclusive, showed a loss in median I. Q. of 7 to 

2. The cases with I. Q.’s between 40 and 49 showed a loss of 1 
12 points. 

8. The cases with I. Q.’s from 70 to 79, inclusive, and with life 
ages from 6 to 12, inclusive, showed a loss in median I. Q. of 10 
to 26 points. 

9. In other words, the younger cases with I. Q.’s from 30 to 39. 
inclusive, gained in median I. Q.; the older cases with I. Q.’s from 
30 to 39, inelusive, lost in median I. Q. 

10. With the exception of the 6-year-olds who lost 2 points in 
median I. Q. the younger cases with I. Q.’s from 40 to 49, inclusive, 
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gained in median I. Q.; the older cases with I. Q.’s from 40 to 49, 
inclusive lost in median I. Q.’s. 


11. All cases of any age with I. Q.’s from 50 to 80, inclusive 
showed a loss in median I. Q. 


B. CASES CLASSIFIED ACCORDING TO IQ 


No. of I.Q. Range Median I.Q.’s at Successive Examinations. 


Cases 1 2 3 4 5 6 
14 30-39 35 33 31 30 37 
29 40-49 47 49 46° 48 46 
63 50-59 56 56 54 51? 50 48 
59 60-69 65 60 60 58 60 57 
33 70-79 73 67 62 60 58 56 
SUMMARY. 


1. The cases with I. Q.’s between 30 and 39 showed a gain of 2 
points in median I. Q. at the 5th re-examination. 
point in median I. Q. at the 5th re-examination. 


3. The cases with I. Q.’s between 50 and 59 showed a loss in 
median I. Q. of 8 points at the 6th re-examination. 


4. The cases with I. Q.’s between 60 and 69 showed a loss in 
median I. Q. of 8 points at the 6th re-examination. 


5. The cases with I. Q.’s between 70 and 79 showed a loss in 
median I. Q. of 17 points at the 6th re-examination. 


6. All cases with I. Q.’s below 50 remained practically sta- 
tionary with a gain or loss of but 1 or 2 points in median I. Q. 


7. The cases with I. Q.’s between 70 and 80 showed loss. The 
highest initial I. Q. showed the greatest loss in median I. Q. at 
the final re-examination. 
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CASES CLASSIFIED ACCORDING TO IQ AND ALSO ACCORDING 
TO INTERVALS OF RETESTS. 


IQ Median IQ’s at initial test and at intervals of retests 
cases range 


1 
30-39 34-32° 
30-39 
30-39 
30-39 
30-39 
40-49 
40-49 
40-49 
40-49 


46-48 


50-59 
50-59 
50-59 
50-59 
50-59 
50-59 
60-69 
60-69 
60-69 
60-69 
60-69 
70-79 
70-79 
70-79 
70-79 
70-79 


56-56 


65-61 


73-67 


Range of IQ. 


30-39 
40-49 
50-59 
60-69 
70-79 


46-45° 


55-53 


65-60 


73-62 


Points lost at successive examinations. 


SUMMARY. 


All median IQ’s in all ranges showed a loss as follows: 
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Compared. 
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In each group of cases the greater loss in median IQ occurred 
at the end of the fifth examination. 

In each group of cases the greatest loss in median IQ occurred 
in the groups of the higher initial 1Q’s. 


CONCLUSIONS. 


1. A great majority of the cases who are examined as candi- 
dates for the Binet Schools are retarded at the time of first ex- 
amination and then retard either constantly or at a decreasing 
ratio up to time of final arrest. 


2. Comparatively few of the cases examined as candidates for 
the Binet Schools develop normally up to a certain age of arrest 
and then remain at arrest. 


8. In cases whose Intelligence Quotient is below 50 there is a 
tendency for the median IQ to remain constant or to gain or lose 
slightly. The median IQ shows a gain among the younger cases. 

4. All cases with IQ’s between 50 and 80 show a decrease in 
median IQ at any age. The highest initial IQ’s show the greatest 
loss in median IQ after repeated examinations. 

5. In the case of examinations at varying intervals the greatest 
loss in median IQ was at the interval of five years which was the 
last examination for all grades of cases. The greatest loss in 
median IQ was shown by the highest grade of cases. 

6. The IQ is variable in individual cases. 

7. No group of cases showed a marked increase in median IQ. 

8. These conclusions are merely for cases examined as can- 
didates for Binet Classes. It is possible that repeated examinations 


on all retarded cases in a school system might give different re- 
sults. 
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THE DOWNEY WILL-TEMPERAMENT GROUP TEST; 
A FURTHER ANALYSIS OF ITS RELIABILITY 
AND VALIDITY 


By G. M. RucH and M. C. Det Manzo, 
University of Iowa. 


This paper presents the results of a statistical analysis of the 
scores of one hundred and forty-six high school students tested 
with the Downey Group Will-Temperament Test. The discussion 
of these results will be divided into four sections in accordance 
with the specific objectives of the investigation. 


I Tue ReviaBiLiry oF THE TEST AS A WHOLE 


Since the Downey tests are published in only a single form the 
usual method of obtaining reliability coefficients by correlating 
equivalent forms is not applicable here. Moreover, the common 
practice of breaking tests into chance halves for purposes of ob- 
taining the self-correlation can only be carried out with certain 
qualifications as to the justice of this method. Im an earlier pa- 
per’, one of the authors has argued that the correlation of the 
sums of the odd-numbered tests against the sums of the even- 
numbered tests might fairly provide an approximation to a valid 
coefficient of reliability. A few words in explanation of the nature 
of the Downey Will-Temperament Tests will help to make clear 
the limitations of the method of correlating chance halves. 

The scale is composed of twelve separate tests each designed to 
measure one aspect of the total volitional pattern. These are: 


A Speed of Movement (VI-1) 

B_ Freedom from Load (VI-1, 2) 

C Flexibility (VIII) 

D_ Speed of Decision (I) 

E Motor Impulsion (X) 

Self-Confidence (XI) 

G Non-compliance (XII) 

H_ Finality of Judgment (XIII) 

I Motor Inhibition (VII) 

J Interest in Detail (IX) 

K Co-ordination of Impulses (V) 

Volitional Perseveration (VIII-2) 
> ‘Ruch, G. M.: “A Preliminary Study of the Correlations between Bati- 


mates of Volitional Traits and the Results from the Downey ‘Will-Profiles’,” 


Journal of Applied Psychology, 1921, 5, 159-162. 
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This arrangement of the twelve traits into three groups as here 
indicated follows the purpose of the author of the test in that 
traits A to D belong to a common “pattern”; traits E to H belong 
to a second “pattern”; and traits I to L belong to a third “pat- 
tern.” The groupings are intended to throw together the tests 
that are intended to reveal, respectively, three main types of voli- 
tional temperaments known, in order, as: 


I “Hair-trigger” organization, or the explosive type of will de- 
scribed by James (Tests A, B, C, and D). 


II “Willful, aggressive pattern” (Tests E, F, G, and H). 
III “Slow, accurate, tenacious type” (Tests I, J, K, and L). 


The grouping and characterizations of the author of the tests 
is adhered to here strictly. 


Since the scale is of the nature of a diagnostic picture in which 
certain groups (the three patterns) of tests can logically yield 
uniformly high scores while other portions can as logically yield 
very low seores (or at least much lower scores), it follows that the 
total score on the whole scale will have very little meaning. It 
also follows that any attempt to split the total scale into chance 
halves must take care that equal numbers of tests be taken from 
each of the three diagnostic patterns in forming such chance 
halves. If this is done properly we would then have two half- 
scales; each half-scale being composed of two tests from each of 
the three patterns or a total of six tests in each half scale. Such 
chance halves could be made up in this particular case in several 
way, viz., (1) the grouping of all of the odd-numbered tests in 
one half-scale and all of the even-numbered tests in the other 
half-scale, (2) the grouping of the first two tests of every pat- 
tern into one of the half-scales and the last two tests of each 
pattern into the other half-scale, or, (3) the grouping of the first 
and last tests of each pattern into one half-scale and the second 
and third tests of each pattern into the second half-scale. 

In the paper mentioned above, the reliability of the Downey 
tests (individual form) was studied only by the first of these three 
methods. In the present report, the reliability coefficients have 
been determined by all of the three possible ways and the results 
averaged. Table 1 shows these results. The notations as “Method 
1,” “Method 2,” and “Method 38” refer to the above-mentioned 
ways of obtaining chance halves. The notation “r 44” means 
that the coefficient of correlation stated was obtained by correlat- 
ing half (six tests) of the scale with the remaining half (six 
tests) of the scale. The notation “r..” is the porphecied correla- 
tion which would be obtained if two similar whole forms were cor- 
related. The values of the “r.” column were calculated by the 
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use of Brown’s formula, 
r— The ages of the 
1+(n-1) r 


subjects cover a range of from ten to twenty-nine, years. Grades 
seven to twelve are involved. 


TABLE 1 
RELIABILITY COEFFICIENTS 


Method r 33 Tis N 


(Brown’s formula) 
I 0.17 0.29 146 
II 0.31 0.47 146 
III 0.24 0.38 146 
Average 0.24 0.38 146 


The probable errors of these coefficients (Pearson product-mo- 
ment formula) range between 0.05 and 0.06. Table 7 shows the 
probable errors of coefficients of correlation for the various values 
of r reported in this study. 


In view of the wide range of talent represented in the group 
tested, these reliability coefficients cannot be considered to be very 
satisfactory. They are, however, much higher than those pub- 
lished in the earlier paper cited; the range of talent in the former 
group being very much more curtailed. 

There is another possibility which must be considered in accept- 
ing these reliability coefficients at their face value. According to 
our original assumptions, the reliability of the scale as a whole 
can only be obtained by the methods we have used in case the 
tests are correctly grouped into patterns in a proper manner. The 
low reliability which we have found might mean merely that the 
tests are improperly classified at present. In this case, the true 
reliability of the scale (properly arranged) might be markedly 
higher than we have reported. Discussion of this question of the 
grouping of the separate tests should be postponed until the inter- 
correlations of the tests have been presented. 


Il Tue INTERNAL COHERENCE OF THE SCALE 


One of the crucial issues in the evaluation of the Will-Tempera- 
ment Test is the matter of the arrangement of the separate ele- 
ments into the diagnostic patterns. As the authors interpret Dr. 
Downey’s classification of the individual tests into groups which 
picture volitional patterns, it would seem that a valid grouping 
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would imply certain mutual relationships among the tests. These 
are: 


(1) High scores in one test of a pattern would be paralleled with 
high scores in the other three tests of the same pattern. This 
is equivalent to saying that the inter-correlations between 
tests of a given pattern would tend to be positive and high. 

(2) Correlations between each test and the composites of the 
tests of the same pattern would tend to be positive and high. 

(8) The correlations between tests of one pattern and those of 
the other patterns would tend to be zero or very low. 


(4) Correlations between single tests and the total score on the 
scale would be low in all probability and not significant for 
purposes of evaluating the tests. 


In order to test these assumptions about the proper grouping 
of the tests of the present scale, the correlations of each test with 
the composite (sum) of the other three tests of the same pattern 
have been computed. For obvious reasons the test used as one of 
the variables in the correlation must be omitted from the com- 
posite. Table 2 shows the obtained correlations. 


TABLE 2 
CORRELATIONS OF THE TESTS WITH THE COMPOSITES 
Test Composite 
A B+C+D 0.54 
B A+C+D 0.28 
C A+B+D 0.10 
A+B+C 0.39 
E F+G+H —0.08 
F E+G+H —0.23 
G 1+F+H -0.13 
H E+F+G —0.09 
I J+K+L 0.07 
J I+K+L 0.33 
K I+J+L 0.07 
L I+J+K 0.05 


The average inter-correlations of the tests within a pattern is 
probably even a better method of testing the general validity of 


the grouping into patterns. Table 3 shows the results yielded by 
this method. 


WILL-TEMPERAMENT GROUP TEST 


TABLE 3 
INTER-CORRELATIONS OF THE TESTS OF THE SAME PATTERNS 
Variable 1 Variable 2 a 

0.36 

0.28 

0.37 

—0.05 

0.33 


0.05 
Average 0.22 


Pp 
SV aga 


—0.05 

0.00 
—0.02 
--0.16 
—0.15 

0.01 
Average —0.06 


ns 


I J 0.08 
I K 0.12 
I L 0.00 
J K 0.24 
J L 0.18 
K L -0.18 
Average 0.07 


General inspection of Tables 2 and 3 shows that the correlations 
between the separate tests are uniformly low, the second pattern 
(Tests E to H) being the most unsatisfactory. It would be inter- 
esting to pick out from each pattern the best tests if this could be 
done easily. This would properly be done by multiple correlation 
methods and the labor involved would not seem to be justified by 
the general promise of the coefficients reported. In pattern one, 
it would appear that Tests A and D would probably be found to 
be considerably superior to the other two. In pattern two, there 
is no promise that multiple correlation methods would reveal any- 
thing of significance. In pattern three, Tests J and K would ap- 
pear the most promising. 
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III CorRELATION OF THE WILL Traits WitTH GENERAL 
INTELLIGENCE 


Table 4 shows the correlations obtained between the scores in 
the twelve separate will tests and the averages of two standard 
group tests. The latter are the Terman Group Test of Mental 
Ability and the Morgan Group Mental Test. The reliabilities of 
each of these tests is known to be well above 0.90 for ranges of 
talent comparable with those represented in the group considered 
here. The significance of the coefficients of correlations of Table 
4, like those of all of the preceding tables, is obscured by the 
fact that the reliabilities of the separate tests are unknown and 
are probably very low in many cases. 


TABLE 4 


CORRELATIONS OF THE TWELVE WILL TRAITS WITH MEASURES 
OF GENERAL INTELLIGENCE 
Test r 
0.40 
0.19 
0.20 
0.16 


—0.03 
0.20 
0.11 
0.14 


0.16 
0.33 
0.34 
-0.05 


rAS 


None of the above correlations is high enough to lend any sup- 
port to the possible criticism of the Downey tests as being meas- 
ures of general intelligence. The factor of intelligence undoubt- 
edly does enter into some of the tests but not to an extent that 
would invalidate the scale on this score. 


The inter-correlations between the tests have also been deter- 
mined independently of the influence of intelligence by means of the 
usual methods of partial correlation. Table 5 shows such partial 
coefficients. The notations “TAB.i”, “TAC.i”, ete., read: The cor- 
relation between traits A and B which is independent of the in- 
fluence of intelligence, and similarly, for the correlation between 
traits A and C, and the others. 


WILL-TEMPERAMENT GROUP TEST 


TABLE 5 


THE CORRELATIONS BETWEEN THE SEPARATE TRAITS WHICH 
EXISTS INDEPENDENTLY OF THE FACTOR OF INTELLIGENCE 
(OBTAINED BY PARTIAL CORRELATIONS) 


"AB.i 0.33 ‘'EFi 0.04. (0.08 
"ACA 0.22 0.00 IKi 0.07 
‘AD.i 0.34 -002 0.01 
"BC. -0.09 ‘FGi -017 JKi 0.14 
"BD.i 0.31 FHi -016 JLi 0.21 
‘CD.i 0.02 0.00 KLi  -0.18 


By referring to Table 4 again it will be seen that there has been 
little change resulting from the use of partial correlations in elim- 
inating the influence of the factor of intelligence. It may be con- 
cluded that whatever inter-correlation does exists between the 
various separate tests of the same patterns is not produced by 
the common factor of intelligence. The formula used in partial 
correlation is the usual one, following Yule’s notations: 


IV Tue VALipity oF THE Downey SCALE 


The validation of any test of psychology capacities is at best 
a most difficult task. To this rule the Downey scale is probably 
no exception. For an account of the derivation of these tests the 
reader is referred to the manual for use with the Will-Tempera- 
ment. Seale’. 

;,In the earlier paper to which reference has several times been 
uate. it was attempted to secure ratings from competent judges 
acquainted with the subjects on each of the twelve traits of the 
Downey scale. A similar procedure has been adopted in the pres- 
ent study with certain modifications. It was thought that the 
teachers of the high school pupils under study might be able to 
furnish” a set of ratings on the twelve traits which would have 


June, E:..“The Will- Profile.” University of Wyoming Bulletin, 
Vol XVI, No. 4b, November, 19 1919. 
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value as a criterion against which to evaluate the tests proper. In 
adopting this plan, the authors were by no means ignorant of the 
many pitfalls in subjective judgments. As shown in certain 
of the published studies, it is possible, at times, with sufficient 
care, to obtain reliable subjective estimates in the sense of high 
self-correlation among the judges. However, high reliability, al- 
though admittedly highly desirable for such purposes, does not 
guarantee that a valid criterion has been built up. The question 
of the validity of such criteria is by far the more serious issue. 
The fact that ten persons rating a group of individuals for “‘ini- 
tiative,” “leadership,” or “tact” agree with a reliability equal to 
0.75 or higher means merely that they are rating the same thing 
or some other thing highly correlated with this thing, but it does 
not follow of necessity that the ratings correspond to the psycho- 
logical functions which by hypothesis underlie the thing being 
rated. Teachers can furnish ratings on their pupils upon such 
traits as “persistence,” “conscientiousness,” “originality,” “hon- 
esty,” and a host of other traits with a fair amount of reliability, 
yet careful statistical analysis will reveal that little else is in- 
volved in these various separate trait ratings than the teachers’ 
general estimates of the quality of school work being done by these 
pupils. In other words, almost as good results could be obtained 


by asking for estimates of school work and then using these, in 
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turn, as estimates of “persistence,” “originality,” “conscientious- 
ness,” ete. The estimates, then, are clearly not valid except as 
these traits are genuinely correlated with the quality of school 
work and even then it is a very indirect type of measurement. 

On the other hand what conclusions can be drawn from the 
situation of judges not being able to agree upon the presence or 
absence of quantitative statement of psychological traits in a 
group of individuals known to the judges? Does failure to show 
self-eorrelation in chance halves of these sets of estimates carry 
with it any important implication? Does such a negative result 
with respect to a psychological trait prove that the trait in ques- 
tion has no true psychological basis? No universal answer can be 
given to these queries. In the case of many reputed psychological 
rubrics it probably does mean that there is no real evidence for 
assuming the reality of such traits as true mental functions. In 
other cases if not in most the answer is left in an undeterminate 
status, i.e., the experimental results are entirely non-committal in 
the sense that they furnish no evidence, positive or negative, upon 
the issue of the existence or non-existence of the trait in question. 
Even at this it might for all practical purposes be argued that 
unless competent persons are in fair agreement about the presence 
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or absence of reputed psychological characteristics there is no 
direct way of identifying such traits for purposes of study. 

With these limitations of the method of estimates in mind we 
can return to the discussion of the immediate problem. In the 
attempt to build up a criterion for purposes of correlation of the 
trait scores with a second measure of the same traits the high 
school teachers of the pupils in our present study were asked 
simply to nominate the ten pupils in their classes who possessed 
the “strongest will-power” and the ten who had the “weakest will- 
power.” No attempt at further definition of these directions was 
made. The reactions expected were those of the man on the street. 
The only outcome which was expected was that the pooled nomi- 
nations might reveal two groups widely spaced upon a scale of 
“will-power” in the common meaning of that term. When the 
nomination blanks were received it was found that certain pupils 
were invariably reported in the same category. A larger number 
were about as often placed in one group as the other and a stil 
larger number were never once nominated for eithér classification. 
There was, then, evidence of marked agreement in many cases, 
the total number of complete agreements being roughly 20 per 
cent of the total of one hundred forty-six pupils in the high school 
under study. Twenty-eight pupils involving equal numbers of the 
sexes and equal numbers in each of the two categories (i.e., “strong- 
willed” and “weak-willed”’) were selected for further study. The 
agreement was perfect or practically perfect in these cases in the 
sense that either three or four of the four teachers acquainted 
with every pupil placed him in the same category and no teacher 
gave a dissenting vote in any of these cases. The reliability coeffi- 
cient for this final grcup was not determined but its general na- 
ture is assuredly high by virtue of the method of forming the group. 
Instructions were then sent to every teacher who had nominated 
any of the pupils in the final group to rate each pupil upon each 
of the twelve traits of the Downey scale. The trait descriptions 
were taken verbatim from the author’s manual for use with these 
tests. 


The final group therefore comprised (to the best knowledge of 
all of the teachers of the school) the fourteen pupils of the school 
showing the most marked volitional characteristics and an equal 
number most lacking in these same characteristics. As well as 
could be done by estimation we had segregated the highest and 
lowest ten per cents of the group as a whole. Table 6 shows the 
correlations found between the pooled estimates of the teachers 
and the test scores in each of the twelve traits. Inspection of 
these coefficients of correlation does not reveal very definite evi- 
dence of the ability of teachers to jdentify the various traits tested 
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by the Downey scale. Whether this situation can be interpreted 
as meaning that the Downey scale is of doubtful validity is very 
questionable. A saner conclusion would be that teachers are un- 
able to identify the traits of the Downey scale in their pupils to 
any considerable extent (traits J and K, and possibly traits A 
and C being apparent exceptions). It is probably fair to assume, 
however, that the Downey tests cannot be validated by the method 
of subjective estimates with any high amount of assurance. The 
correlations between estimates and test scores are in general of 
about the same magnitudes as those reported in the earlier study 
(where the range was from —0.33 to 0.51 for the pooled estimates 
of three or four college professors; the average being 0.07). The 
average estimate-test correlation is 0.15 for Table 6. 


TABLE 6 


CORRELATIONS BETWEEN ESTIMATES AND TEST SCORES (N=28) 

Trait r Trait r Trait r 
A 0.37 E 0.19 I —0.09 
B 0.02 F -0.03 J 0.51 
Cc 0.30 G -0.04 K 0.53 


D -0.29 H 0.17 L 0.17 


The probable errors of the coefficients of correlation in Table 6 
can be found in Table 7 to a sufficient accuracy for our purposes. 
The same is true of the correlation coefficients given in Tables 1 
to 5. Except in Table 6, the numbers of cases have invariably been 
one hundred and forty-six. 


TABLE 7 
A REFERENCE TABLE OF PROBABLE ERRORS OF PEARSON CO- 
EFFICIENTS OF CORRELATION FOR VALUES OF r 
APPEARING IN THE FOREGOING TABLES °- 

Probable Erors of r 
N=146 N=28 

.056 .127° 

.056 127 

.055 .126 

.055 

054 122 

.052 .120 

.051 A116 
049. 112 

.047 107 

.045 .102 

088 .989 
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CONCLUSIONS. 


The following points are suggested as summarizing the more im- 
portant findings presented: 


(1) The reliability of the Downey Group Will-Temperament Test 
is probably not higher than 0.40 as an estimate of the corre- 
lation which would be found between two similar forms of the 
tests were a second form available. It is no doubt true that 
certain of the patterns are more reliable than others, a fact 
which militates against the reliability of the scale as a whole. 


The validity of certain of the individual tests is doubtful. The 
inter-correlations in patterns II and III are low, pattern I 
being the most satisfactory. The same situation holds for 
the correlations of the individual tests with the composites of 
the remaining tests of the same pattern. 


(3) The lack of inter-correlation between tests of the same pat- 
tern in the case of pattern II, and to a less degree in the 
other patterns, may be due to either: (a) unreliability of the 
individual tests (not determined in this study), or (b) faulty 
location of certain tests. In the latter case further study 
might suggest changes from one pattern into another. 

(4) There was found no reason to believe that the Downey tests 
measure merely intelligence. The inter-correlations, where 
they are significally high, are not due to the common factor 
of intelligence. 

The attempt to validate the separate tests of the scale by 
means of teachers’ judgments yielded ambiguous results. It 
would appear that strict validation of the Downey tests by the 
method of estimates would be a very difficult task. We can 
conclude that, with a few exceptions, high school teachers 
cannot identify the psychological traits measured by the 
Downey tests by study of the characterizations of those traits 
which have been published by the author of the tests. It 
should be pointed out that the estimates used in this study 
cannot be assumed to be valid, although their general relia- 
bility is doubtless satisfactorily high for purposes of a rough 
criterion. The question of the validation of the Downey tests 
can better be dismissed by refering the reader to Dr. Downey’s. 
manual, previously cited, for a discussion of this point. 

In general there are a considerable number of tests in the 
Downey Will-Temperament Test that show promise of lending 
themselves to further study and development. Duplication of 
such tests and probably a number of substitutions will lead to 
a more valid instrument for the measurement of volitional 
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traits. The urgent need for such measurements demands that 
the pioneer efforts of Dr. Downey be rewarded with the rec- 
ognition that much promising ground had been broken by her 
patient investigations and that further progress has been 
made possible in this direction. 
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THE FRESHMAN: THORNDIKE. COLLEGE EN- 
TRANCE TESTS, FIRST SEMESTER 
GRADES, BINET TESTS 


By W. T. Root, 
University of Pittsburgh. 


INTRODUCTION. At the beginning of the academic year, 
1921-22, the College of the University of Pittsburgh* gave the 
Thorndike Intelligence Examination for High School Graduates 
to a Freshman class of 569. The tests given were: Part I, Form 
| (designated as T1); Part I, Form M (T2); Part II, Form D 
(T3); Part III, Form DP (T4). The total, that is the sum of the 
four tests, is designated T5. The test was given without inter- 
mission other than the short intervals provided for in the in- 
structions. 


Results. The following averages were secured: 


Test Average 
Tl 105.119 
T2 105.276 
T3 101.504 
T4 32.97 


T5 


341.61 


Final score (T5 divided by 5) 68.38 
Median score, 70 
Range 21 to 110 


Correlations between different forms. The forms were correlated 
with the total singly and in combination: 


Test r (Pearson) P. 
T5 with T1 .848 .007 
T5 with T2 .829 .009 
T5 with T3 .904 .005 
T5 with T4 .754 01 
T5 with T1 plus T2 .914 .004 
T5 with T1 plus T3 -960 .002 


*The Department of Psychology gave and scored the tests. The writer 
is indebted to the graduate students in psychology for many weary hours 
of statistical ealeulation. Special acknowledgment is due Professor Florence 
M. Teagarden for transmuting grades into sigma values and for giving the 
Binet Tests to the Freskmen girls. Mrs. Ida Nyce Root is to be thanked for 
many days spent in the vexatious task of statistical rechecking and of 
ferreting out errors. 
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The high correlation of T1 plus T3 with T5 suggests very 
strongly that T2 and T4 be eliminated in future testing. This 
would mean a great economy in the time required to give the test, 
in the time required to score, and in the cost of test materials. 


Correlation of T5 with the academic grades for the first semes- 
ter. In attempting to summate the academic grades the writer 
met with the usual difficulties. It seemed best after all factors 
were considered to resort to the tedious method of transmuting all 
grades to sigma values thus correcting for the great variance in 
distribution of A’s, B’s, C’s, D’s, and F’s. The sigma value was 
weighted according to the number of units of credit the course in 
question carried. The weighted sigma values were added and 
divided by the number of units of credit carried by the particular 
Freshman in question. This procedure overcomes the injustice 
usually experienced by students carrying “stiff” courses (i.e., 
courses with a high percentage of D’s and F’s and a low per- 
centage of A’s and B’s). The sigma values of A, B, C, D, and F 
were determined for each subject taught to Freshmen. D is the 
lowest passing grade while F is failure. 


The correlations between the academic grades and the Thorn- 
dike forms are as follows: 


P.E. 
with grades .42 .02 
with grades .33 .03 
with grades .36 .03 
with grades .415 .02 
with grades .51 .02 


The Thorndike total was now correlated with each college sub- 
ject taken by the Freshmen, debarring a few in which the number 


of beginning students was too small to admit of correlation. (Tab- 
ulation I) 
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The correlations are moderately high and considerable signifi- 
cance as N is large in the majority of cases and the group is 
homogeneous. The correlation of the Thorndike test with Human 
Progress is conspicuously high (r. .689) with a very small prob- 
able error (P. E. .01). This course deals with a large number 
of the arts and sciences in a semi-philosophical way. The classes 
are large and a premium is placed on general information and 
linguistic ability. English 1, Graphics 2, Mathematics 21 and 
French 1 correlated less highly than the other courses with the 
Thorndike total. English may be a matter of home conditions, 
foreign parentage or drill and follow up work through four years 
of high school. The students enter Graphics and Mathematics with 
very different prepsrations; French may be very easy for the stu- 
dent with a fair knowledge of Latin or one of the Latin languages. 
If Italian is spoken in the home it may prove to be a hinderance 
in an English course and a boon in a French course. One may 
speculate indefinitely as to the general causes for correlation or 
non-correlation. 


The prognostic value of the Thorndike College Entrance Tests. 


It is possible to have reasonably high correlations without much 
justification for applying the findings to an individual case. Dis- 
tinct sets of causes may be at work: thus, we may have nearly 
perfect correlation with a portion of the cases, a moderately posi- 
tive trend for a large median group, with a significant group show- 
ing a negative correlation. Again, we may have marked negative 
or positive correlation in certain regions of the data with an in- 
different correspondence in other regions. It becomes pertinent, 
then, to ask for’an analysis of the tendency to correlation or rank- 
correspondence as well as to compute the coefficient of correlation. 


The chief reason for administering the tests now under consid- 
eration to Freshmen is to predict their capabilities in connection 
with academic college requirements. It therefore seemed desirable 
to rate the Freshmen in some form of grade classification, ar- 
ranged in rank-order, and to compare this ranking with the Thorn- 
dike rank-orders. The conventional distribution of grades over a 
+8a normal distribution curve is, perhaps, as good as any. The 
data can then be tabulated as follows (A, B, C, D and F referring 
to grades and a, Sy and e referring to precisely the same per- 
cental divisions in the descending rank-order arrangement of the 
Thorndike totals) : 


Bors Cory Doréd Fore 
45 24 3.5 
No. cases........ 3 137 255 137 20 
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The college grades are expressed as sigma values and each Fresh- 
man’s rank-order is determined by finding his average sigma (the 
sum of his weighted sigmas divided. by the number of units car- 
ried). These average sigma values arranged in descending rank- 
order are then ready for comparison with the corresponding rank~- 
orders in the Thorndike tests. If the Thorndike test is prognostic 
of academic ability then the 20 students composing the A group 
should have a rank-correspondence such as to place the majority of 
them in the « group (the 20 ranking highest in the Thorndike 
group). Tabulation II presents the rank-orders for college grades 
and Thorndike scores for comparison and analysis of the rank- 
correspondence in different percental regions of the rank-order 
series. The question is: Does the Thorndike Test performance 
anticipate the quality or approximate quality of college work a 
particular Freshman will do? 


TABULATION II. 


B TOTALS 


| 

7 | | O 7 

| 

9 61 "5 6 

6! 33 5 

! 
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The following observations may be made directly from the tab- 
ulated data: 

1. No. a’s are ranked D or F in academic standing. 

2. No. es attain A or B standing. 

3. But of those making A standing, only 1/5 are a’s and the 
largest number are drawn from the y group. 

4. Had superior scholarship.'been predicted by selecting the 
a group we would actually have secured 4 A students along with 
9 B students and 7 C students. 

5. 80 of the 157 students comprising the A and B groups are 
in the « and § groups. 

Seventy-one of the 157 comprising the D and F groups are in 
the 5 and e groups. 

That is, the upper 27.5% of the Thorndike rank-orders would 
have been somewhat selective as regards academic attainment. 
Had the e group been taken as a basis of entrance cxclusion, it 
would have resulted in the excluding of 8 D students and 9 C 
students out of a total of 20 « students. 

6. Had the college wished to maintain very high standards and 
had excluded 6 and ‘e standings, it would have excluded 79 C 
students, 6 B students, and 1 A student out of a total of 157 
Freshmen. The Thorndike ranking would have correctly placed 
71 (that is 5-e’s who are in corresponding academic rank-orders, 
namely the D-F groups), while random choice alone would have 
correctly placed 44 of these. 


7. On the other hand, if the a-8-y groups had been used as a 
basis of initial retention in college, 86 D and F students would 
have been admitted into the Freshman class against the 71 D and 
F students excluded in the 5-e’s exclusion. Chance alone would 
have given us 114 D and F grade Freshmen, out of a total of 
410 a-f-y’s. 

Of the 569 students chance alone would have given 157 DF’s. 
Had the d5-e groups been arbitrarily excluded (and the a-f-) 
groups retained), 71 D-F students would have been excluded and 
86 D-F included. But 86 A-B-C students would have been thus 
excluded. This would give a total of 176 improper inclusions and 
improper exclusions or 19 more than the improper inclusions 
chance alone would have given. 

8. There are five comparable groups, a-A, 8-B, y-C, 8-D, and 
«-F. Of these groups, the y-C is the only one that is largest for 
the column in question. In this case it is totally without prog- 
nostic significance as chance alone would have placed 115 C’s in 
the y group. It should furthermore be noted that the distribution 
of C students through a, 8, 3, y and e is about the distribution 
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chance would have given. And likewise the row has a close-to- 
chance distribution through A, B, C, D and F. Hence for a stu- 
dent to be in the y group argues nothing as to his academic rank- 
order; and vice versa, C grades run the gamut of a near-chance 
distribution curve from a to e. 


Deducting from each of the five comparable groups (A-s, B-§, 
etc.) the number of cases that would fall in these groups by 
chance alone, gives the number of cases precisely located in a 
corresponding rank-order. We have: 


Of 20 a’s, 3 A’s or 15% in precise corresponding rank-order. 
Of 137 §’s, 27 B’s or 19% in precise corresponding rank-order. 
Of 255 y’s, none. 

Of 137 8’s, 20 or 15% in precise corresponding rank-order. 
Of 20 es, 2 or 10% in precise corresponding rank-order. 


9. Compiling 8, y and 5 we have 529 cases, comprising 128 B’s, 
246 C’s, and 129 D’s. Thus the Thorndike tests would have cor- 
rectly placed 12 more subjects than chance alone or about 2 per 
cent of the total number of students in the combined £8, y and 6 
groups. 


TABULATION III. 


| Scoresover 95 | 9/2 


3 | 5 0 0 

Seores8Sto95 2 24 7 4 

Scores 70to 85 20336) 6 25 92 45 23 12 

Scores 60to 70 143/25] O & 53 | 37 | 27 18 
Scores 50to60 | 2 18 27 | 34 14 

Scores under 50 1! 0 7 26 15 


AN ANALYSIS OF THORNDIKE’S GROUPINGS 


Thorndike attempts to prophecy on a basis of the absolute 
scores received. He makes the following classification: 

“A boy scoring over 95 is worth admitting in almost entire dis- 
regard of technical deficiencies. 

A boy scoring 85 to 95 has intellect enough to do collegiate 
and professional work with distinction. 

A boy scoring 70 to 85 has intellect enough to do the work te 

obtain a college degree. 
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A boy scoring 60 to 70 may be admitted if he is sufficiently 
in earnest and otherwise desirable. 

A boy scoring 50 to 60 should be admitted only if he is of 
extraordinary zeal or has_ suffered 
very great educational handicaps. 

A boy scoring under 50 should not be admitted.” 

The accompanying tabulation (III) follows Thorndike’s group- 
ing. The academic grade rank-orders have been arbitrarily 
classified to agree in rank correspondence with the Thorndike 
classification. 

Conclusions: 


1. None of the Thorndike A group falls in the corresponding 
academic group a (the 9 highest academic rank-orders); however, 
none falls in the lowest academic groups, e and f. 

2. Likewise none of the F group falls in the academic a or b 
groups. 


» 


3. The students composing the a group came chiefly from the 
C group. 

4. Taking Thorndike’s C group as the dividing line for in- 
clusion in or exclusion from college entrance, one would include 174 
correctly but also 93 incorrectly, (students making d, f, and e 
grades). On the other hand, the elimination of the D, E, and F 
students would have ‘excluded 93 students who are doing work 
in the a, b and c groups. Thus there would be 186 improper in- 
clusions and exclusions in using Thorndike’s A, B and C groups as 
an inclusion basis against 209 proper exclusions. Out of 569 
students 302 would be excluded. It must be recalled that in ex- 
cluding such a large percentage of the students seeking admission, 
chance alone would have correctly excluded 161. 

5. Excluding E’s and F’s would retain 85 of the upper 
Thorndike groups doing no better work than E’s and F’s, and 
would exclude 85 E’s and F’s doing: work on a par with those 
admitted. Granting that e and f grade work warrants exclusion, 
we have 74 proper exclusions, and 170 improper inclusions and 
unjust exclusions. 

6. Let us take Thorndike’s lowest group, F, (scores below 50), 
which Thorndike categorically asserts should be excluded. Of the 
63 comprising this group 48 do work on a par with many in the 
retained groups, while 48 in the retained groups are doing the 
same quality of academic work as those excluded. Thus we have 
96 improper exclusions and inclusions, and 15 proper exclusions. 

7. Tabulations II and III lead to the same general conclusions. 
While the correlation between grades and Thorndike scores is 
reasonably high, an inspection of the differences in rank-order 
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correspondence indicates that great injustice would be done, both 
in inclusions and exclusions, were the entrance tests used as a 
basis of college admission. In actual practice the lines of de- 
marcation between groups are too illy defined to warrant such 
drastic action as ‘off-hand exclusion from college. However, the 
trend of correlation is sufficiently strong to far surpass any im- 
pressionistic estimate and the tests should constitute an excellent 
basis for the initial analysis of cases, and for judicious use in 
attempting to size up Freshmen failures. Whether the mass test 
is more truly prophetic of final college success than the combined 
estimates of dozens of instructors after seventeen weeks contact is 
a speculative matter which can be decided only by follow-up work 
for a number of years. 


EXTREME UPPER AND LOWER RANK-ORDERS IN COLLEGE EN- 
TRANCE TESTS COMPARED AS TO PERFORMANCE 
IN BINET TESTS 
Forty of the upper rank-orders of the Thorndike Test (1to 58 
inclusive) and also 40 of the lower rank-orders (479 to 569 in- 
clusive)* were given the Binet Tests... The accompanying 
tabulation (III) shows clearly that the correspondence between the 


two tests is extremely close, quite in contrast to the vague cor- 
respondence between the Thorndike and the academic standing. 


*Inability to arrange a time suitable for both the student and the ex- 
aminer giving the Binet Test, made it necessary to go below rank 40 and 
above rank 529 in the lower group. One of the lower rank-orders was 
eliminated. The subject, an intelligent Italian just learning the language, 
refused to try the Binet Test after doing the vocabulary. 

‘The student appointments for the Binet Tests were arranged through 
the co-operation of Professor B. F. Ashe, Freshman Advisor. 
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TABULATION IV 


Binet Tests Used as a Basis for Comparing the Extreme Upper 
and Lower Rank Orders of Thorndike College 
Entrance Tests 


40 highest 39 lowest 
Rank Orders Rank Orders 
Chronological age range 22-8 to 16-5 27 to 18-2 
Chronological age median 20-1 18-5 
Mental age range 19-6 to 17-2 18 to 12-1 
Mental age median 19 15-5 
Vocabulary range 16,560 to 12,780 12,600 to 6,840 
Vocabulary median 10,260 
Number failing 
XVIII Superior adult vocabulary 39 
Binet’s paper-cutting test 25 
Repeats 8 digits . 24 
Repeats thought of passage.... 26 
Repeats 7 digits backward 31 
Ingenuity test 31 
Average adult vocabulary 
Interpretation of fables 15 
Difference between abstract 


Problem of the enclosed boxes. 
Repeats 6 digits backwards 


Vocabulary 50 words 
Induction test 
President and king 
Preblems of fact 
Arithmetical reasoning 


Vocabulary 40 words 
Abstract words 


Gives similarities 
x Vocabulary 30 words 
Absurdities 
60 words 
Total No. of tests failed 
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11 
11 

1 17 

23 

XIV 8 

8 
Cloek 
XII 1 

Ball and 4 
Dissected 3 
1 
Repeats 5 digits backwards...... 3 
4 Pictures interpretation............... 1 
1 
44 361 


THE FRESHMAN COLLEGE ENTRANCE TESTS 87 


The lower group is made up of older students and it may be 
added has more of foreign birth and foreign family. It is a 
matter of general observation that older persons show less as- 
sociative plasticity and speed. ' 


Concerning I. Q. and mental age: 


8 of the lower group had I. Q.’s 100 or more. 
31 of the lower group had I. Q.’s less than 100. 
3 of the lower group had mental ages of 12(11-7 to 12-6) 
5 of the lower group had mental ages of 13(12-7 to 13-6) 
4 of the lower group had mental ages of 14(13-7 to 14-6) 
8 of the lower group had mental ages of 15(14-7 to 15-6) 
12 of the lower group had mental ages of 16(15-7 to 16-6) 
4 of the lower group had mental ages of 17(16-7 to 17-6) 
2 of the lower group had mental ages of 18(17-7 to 18-6) 


In the upper group the lowest I. Q. was 104. Nineteen failed 
in the high adult digits tests; 7 failed in both, 14 passed all of 
the tests; 13 failed 1 test; 11 failed 2 tests; 1 failed 3 tests; and 
1 failed 5 tests. 


The fact that 25 of the 44 failures in the upper group were digit 
tests indicates that the digit tests (counting an entire year of 
mental age) are entirely out of proportion to their importance in 
determining mental age. A careful study of the individual cases 
gives little evidence that immediate verbal auditory memory span 
for digits is of any marked value in determining superior intelli- 
gence. It is a matter in which there is marked individual dif- 
ference in adults with slight reason for believing that there is a 
corresponding intellectual difference. 


Reading and: Vocabulary. The most striking contrasts be- 
tween the groups is in the vocabularies and reading habits. The 
highest vocabulary in the low group is less than the poorest vocab- 
ulary of the high group. 


An attempt was made to get at the reading habits of the 
groups under consideration. A rough reading scale was impro- 
vised, scoring as follows: Pan 


.. Readers of critical and difficult literature. 
Superior readers, read and wood current 
3. Average readers. 
4. Readers of newspapers and ert news. 


5. Readers who do almost’ no’ voluntary reading being’ nearly 
devoid of a settind habit. 
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Tabulating for the two groups we have: 
Reading Number in ‘Number in 
High Group ; Low Group 


The contrast here is very marked indicating decided lack of 
reading habits in the lower group. 


TABULATION V. 


Cases With Marked Inconsistency in Performance in Upper and 
Lower Groups 


eading 


ank 


REMARKS 


Thorndike 
‘Test Rank 
Academic 
Age Binet 


Rank 
Mental 


oR 


Out of school 3 yrs. Trig. & Physics 
difficult 


rw 


19 
18% Out of school 3 yrs. Indifferent to 
grades 


19% Exemption grades in H. S. 

19 

19% 

19 

19 

19 Absent one month in first semester 
19% 

18% 

18% 

17-2 

13-6 Italian. Foreign parent. 

16-5 Jewish. Soho district. 

15-8 Jewish. Foreign parent. 

17-1 Attended sixteen. different schools. 
16 


14-9 Italian. Foreign home. 
13-11 Jewish. 
17-7 Jewish.-~- 

12-2 Negro. ~++ 

18 Commutes four hours daily. 
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6 137 92 ha 
10 197% 83 1- 
13.5 188% 81.5 
20 297 87 ca 
27% 223% 79 th 
30 197 90% 
385% 554 88 of 
35% 301% 84 Ss} 
: 40% 542 81 vi 
350% 73 Ww 
58% 258 89.5 
522 283 58 
524 71 56 p 
526 172 67 fi 
506% 290% 70 
538% 249 62 a 
545% 267% 40 g 
553% 264 63 is 
554 208% 70 c 
557 157% 48 ¢ 
561 206 68 0 
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Analysis of cases showing marked contrasts in performance. 
It would have been highly desirable to have studied the social 
conditions surrounding these Freshmen composing the upper and 
lower groups. This was impossible, due both to lack of time 
and to the diffidence on the part of students concerning their 
social and economic status. Nevertheless, after personal inter- 
views with the students in question, it is possible to throw some 
light on the marked inconsistencies in test performance and 
academic rank, presented in the accompanying Tabulation V. 


CONCLUSIONS 

1. A high Thorndike rank is accompanied by a high Binet 
score and large vocabulary; and vice versa, a low Thorndike score 
by low mental age and small vocabulary. 

2. But high vocabularies are not necessarily found with a 
superior reading ranking or vice versa. 

3. Of the 40 in the upper Thorndike group, 14 are distinctly 
below any corresponding academic rank-order, while 7 fall below 
the median rank-order, 285, of the entire Freshman class. 

4. In the lower group just the opposite is found; with low 
mental age, low vocabulary and low entrance examination score is 
found a relatively high academic rank-order. Of the 10 cases, 9 
have academic ranks above the median rank-order of the Thorn- 
dike scores. 


5. The cases showing marked inconsistencies constitute nearly 
1-3 of the total number of the 79 cases studied intensively. 

6. It should also be noted that in spite of exceedingly low vo- 
cabulary and mental age, cases 522, 553%, and 557 make more 
than median grades. 


= 


7. In the lower group, 6 are either Italian or Jewish. In all 
of these cases, and many more not presented here, the parents 
speak a foreign language. Is it not to be expected that an English 
vocabulary and tests resting on the niceties of. English idiom 
will suffer? 

8. Absence during the semester and a several-year gap be- 
tween high school and college seem to be the most obvious and 
plausible explanations in certain cases of relative academic in- 
feriority in the upper group. 

9. The negro, case 557, unlike the Jewish and Italian cises, 
appears very dull. He is carrying, however, a very heavy pro- 
gram including mathematics, physics and chemistry in which he 
is making more than median grades. An examination of the per- 
centage of A’s, B’s, C’s, D’s and F’s given in these courses indi- 
cates that they are not “pipe courses,” (see Tabulation I). In the 
opinion of the writer the explanation is this:. The student has 
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had an excellent high school training with a careful drilling in 
technical courses that dovetail into the courses taken in college. 
It is doubtful if the chap will long maintain an academic ranking 
of 157% against a Thorndike rank of 557 and a Binet mental 
age of 12-2. 

10. The personal interviews elicited the following, at least 
plausible, explanations for the marked rank-order difference be- 
tween academic grades and the Thorndike tests: 

(a) Some students are capable but indifferent to grades, tend- 
ing to follow an interest rather than make punctilious preparation 
of daily tasks. Case 8 is a good example. In the opinion of the 
writer, the Thorndike test places him more nearly correctly. The 
chap has been out of school three years between high school and 
college, which is perhaps a contributing factor, but the chief 
difficulty is that he has very positive ideas concerning academic 
values and refuses to fulfill, what he considers to be, petty class- 
room demands. He has already been successful in the work-a-day 
world. 

(b) Outside work, undoubtedly, in ‘many cases makes for 
poorer grades. 

(c) Commuting tends, also, to produce a poorer, grade of work 
than otherwise. 

(d) The character of the high school work plays a large part 
in the character of the Freshman college work. For example, one 
high school, gives to its seniors a course on Current Literature. 
All of these students show a relatively greater general informa- 
tion than the average high school graduate. Another high school 
apparently gives unusual work in science and mathematics. One 
of the students from the lower Thorndike group, with good grades, 
stated that the physics and mathematics he was taking were 
almost entirely review. Again, a student changing from a literary 
or a scientific course in high school to a scientific or literary in 
college is greatly handicapped over the one who has had a con- 
stant vocational objective. All considered, students who have 
established good habits in mathematics, science, laboratory and 
English composition, regardless of their native endowment, have 
an advantage over those who have not. 

(e) Unusual zeal for learning and persistence mark the at- 
titude of many of the foreigners, éspecially the Jews. The greater 
the economic and cultural handicap, seemingly, the greater the 
zeal. 

(f) One might elaborate indefinitely, but this is sufficient to 
indicate that a careful study of the individual case is the next 
step after the rough grouping of a mass intelligence test. It is 
much easier, and far more satisfying, to deal in broad statistical 
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trends than it is to tease out the vexatious problems of individual 
causes in variation. But, the latter is the next logical step in 
testing. 

11. Errors in grading undoubtedly play a part in the rank- 
order differences between the Thorndike rating and the instructors 
rating. Impressionistic methods of grading, marked bias in 
weighting certain qualities in work and too few facts for estimate 
often cause the instructor’s grade to be a long way from the 
absolute facts. 


CONCLUSIONS 


1. While there is a fairly high correlation between academic 
grades and the Thorndike tests, upon analysis the approximation 
is found to be too rough to permit exclusion, inclusion, or the 
assumption that the highest test group (above 95) is peculiarly 
superior. 

2. Nevertheless, the college entrance tests are not without 
merit, offering as they do, an initial point for analysis and 
judicious prognosis, but unfortunately, not scientific prophecy. 
Such tests are undoubtedly superior to any impressionistic esti- 
mates of the Freshman. It becomes necessary, however, to fol- 
low up the test findings with a careful analysis of the qualitative 
character of the score; to supplement with a common-sense in- 
vestigation of the nature of the high school training noting how 
well it dovetails into the curriculum of the particular student in 
question. The home, language, zeal, outside employment, and so 
on must be weighed and evaluated. The tests form but a part of 
a complex differential diagnosis. All factors must be carefully 
summated analogous to a court’s getting a perspective on cir- 
cumstantial evidence. 

3. Whether the student is surpassing or falling below the 
Thorndike rank-order, he is equally of interest. At present such 
cases demand an investigation to “show cause.” 

4. Not only is the Freshman up for critical study and evalua- 
tion but the test purporting to evaluate him is equally on trial to 
show it has value. 

5. A combination of Part I, form 5, and Part III, form D would 
probably be as valuable as all four forms; thus enacting a saving 
of 50 per cent in giving, scoring, and purchase price of blanks. 
But it should be added that for purposes of analysis, much is to 


be gleaned by a study of the qualitative character of all four 
forms. 
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6. The rank-order comparison between the Thorndike tests and 
the academic grades should be continued throughout the college 
course. At present any assumption as to the superior prophetic 
value of the mass intelligence test over the Freshman’s academic 
performance, is speculative and idle. 
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THE UNRELIABILITY OF M. A. AND I. Q. BASED 
ON GROUP TESTS OF GENERAL ; 
MENTAL ABILITY.’ 


By ARTHUR I. GATES 
Teachers College, Columbia University 


The practice of computing mental ages and _ Intelligence 
Quotients from scores of group tests of general mental ability has 
been questioned recently on theoretical grounds, and various 
workers are discovering the unreliability of these measures by 
comparing the results obtained from different tests.° This paper 
presents data which show that at least under certain conditions, 
quite ridiculous IQ’s may be derived from certain group tests. 

The following tests were given on successive days to the pupils 
of Grades I to VIII inclusive of the Scarborough School: Dear- 
born, Examination I or II, Haggerty, Delta I or II, Holleys Pic- 
ture Completion or Sentence Completion, Illinois (III to VIII 
only), Otts Primary or Advanced, National Intelligence, Forms A 
and B, (III to VIII only) and Meyers Mental Measure. The 
Stanford-Binet was given during the year to Grades I to VI 
inclusive. 

The tests were given to groups of twenty or less and particular 
attention was given to time, instructions, provision with pencils, 
etc. The pupils had learned to take tests earnestly and we are 
confident that the test conditions in general were more uniform 
and satisfactory than will be obtained in ordinary school practice. 
The results were scored by an expert. 

The mental age equivalents of the scores on the several tests 
were computed by use of the age norm tables accompanying each 
test... In most cases the line describing the median age score was 
extended to cover such of our scores as exceeded the highest age 
norm given. From these data, IQ’s were computed. 

Table I, which gives the data in detail for Grade I, illustrates 
the treatment of results. The horizontal lines give the facts for the 
‘veral subjects; the vertical columns give the facts for the class. 
lumns 1 to 6 give the IQ’s which are summarized at the foot in 
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the form, (1) the highest IQ in the class (2) the lowest, (3) the 
mean and (4) the mean deviation from the mean. Column 7 gives 
the mean IQ for each subject; column 8 gives the mean deviation 
of the subject’s various IQ’s from his mean and column 9 gives the 
spread i. e., the subject’s highest IQ minus his lowest IQ. These 
data are summarized at the foot of each column. 

The facts of Table I may be considered with reference to certain 
important school problems. 

1. Is the class, on the whole, composed of bright or average 
pupils? A very general answer — bright — may be given, but 
the degree of brightness varies widely according to different 
tests. The mean IQ’s range from 109 (Dearborn Examination) to 
129 (Myers). According to the one the group should do slightly 
better than average work, according to the other, very much bet- 
ter. According to the one the average pupil could scarcely com- 
plete a college course, according to the other he could-do so with 
ease. Assuming the average chronological age to be six years 
according to one test the mental age would be 6.5 years corre- 
sponding to first grade ability, but according to the other test the 
mental age would be 7.8 years corresponding to advanced Grade 
II work. 

2. Are the pupils of uniform or diverse brightness? The 
figures for the lowest IQ are fairly uniform, ranging from 92 to 
99 but those for the highest are diverse, ranging from 121 to 165. 
The mean deviation of IQ’s from the average are for the Dearborn 
6.0 and for the Myers 12.3, with figures for the other tests scat- 
tering between these extremes. Recommendations concerning sec- 
tioning for purposes of promotion would depend very greatly on 
what test is used. 

3. What is the IQ of a particular pupil? Pupil A is. given 
IQ’s ranging from 104 to 165, that is, by one test he would be 
classified as “average” by another a “genius.” Pupil A _ has 
various IQ’s ranging from 104 to 144. Column 9 gives the differ- 
ence in points of IQ between the highest and lowest. The largest 
difference is 61, the smallest 15, the mean 32. On the average a 
pupil ranges from “average” to “very superior” intelligence de- 
pending on the test. If we take as measures of variability the 
mean deviation from the average IQ instead of the whole range 
the figures given in column 9 are of course smaller the greatest 
being 17.0 the smallest 3.8, the mean 9.3. That is if the average 
IQ is 118 in the average case half of the IQ’s obtained from the 
several tests will fall between the limits 108.7 and 127.3. Further 
statistical treatment is unnecessary. It is obvious that the varia- 
bility of the IQ’s obtained by several tests all given within a few 
days is so great as to make prediction concerning optimum rate 
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TABLE I 


SHOWING I .Q.’s ORTAINED BY PUPILS OF GRADE I IN SEVERAL TESTS 
1 2 3 4 5 6 7 8 9 


J 96. 99 117 104 107 152 113 148 # «56 
107 99 3.8 
L 120. 115 131 127 114 136 «124 7.5 22 
M108. 108 125 8 100 99 123 111 9.2 26 
N 110. mea 226 7.5 25 
106 17.0 
117.5 117.5 7.7 
Highest135. 121 144 1384 125 165 126 17.0 61 

Lowest 92.5 92 98 98 99 99 100 3.8 15 

Mean 113. 109 129 114 113 «129 118 9.2 32 

M.D. 10.1 6.0 7.6 8.9 68 12.3 5.4 2.2 9.6 


SUMMARY OF RESULTS FOR GRADE II n=19. 
Highest141 147 148 155 150 196 150 = 17.5 
Lowest 103 97 122 108 92 105 114 5.3 21 
Mean 117 116 1382 135 124 #41385 «#4127 «10.0 35 
M. D. 7.4 7.3 6.9 124 12.6 15.4 76 28 
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of promotion, possibility of completing college, ete. quite out of 
the question. 


RESULTS FOR GRADE II 


The results for Grade II are summarized in the lowest section of 
Table I. For this grade, the group tests, all of them non-verbal, 
are the same as those used in Grade I. 

The results are similar. The IQ’s yielded by the Myers test 
show an average deviation from the mean which is more than 
twice that given by the Otis. One individual receives an IQ on one 
test which is 76 points higher than that given by another test; the 
difference between an average intelligence and idiocy. The 
average range of *Q for an individual is 35 points. 


RESULTS FOR GRADES III to VIII 


Table II gives summaries of the data for grades III to VIII. 
This Table shows the ridiculous results which teachers and other 
workers are likely to obtain from efforts to derive M. A.’s and 
IQ’s from group tests in these grades. For example, one fairly 
bright boy in Grade VII earns a mental age of 46 years on one 
test giving an IQ of 408. If they should take the results of 
certain tests seriously there are many pupils who could think 
rather pitifully of Sir Francis Galton whose IQ is estimated by 
Terman to be only about 200. According to the Myers test the 
average IQ’s of our Grades VII and VIII is 261.° Picture the per- 
plexity of the teacher test enthusiast who finds that pupil K in 
Grade V earns an IQ of 99 on the National test and 366 on the 
Myers. Many others are nearly as baffling. 

The variability of scores obtained by a given individual be- 
comes yreater as the grade becomes higher, as may be seen in 
Table III which gives the range of IQ’s and M. D.’s for the aver- 
age individual in each grade. 

These facts should not be interpreted as a sweeping criticism 
of the usefulness and validity of group tests of general mental 
ability. What is urged is a restraint of certain inappropriate 
uses of these instruments. We should not, it is clear, compute 
M. A.’s and IQ’s from raw scores, but these raw scores them- 
selves may be put to profitable use. It has been shown elsewhere that 
these group tests at least those very verbal in content give sub- 
stantial correlations with achievement in school work." 


5I1Q's could not be computed for National and Otis scores for most of 
our Grade VII and VIII pupils because they earned scores considerably 
higher than those represented by the line when it becomes horizontal 

A. Ll. Gates, Correlations of Intelligence tests with achievement in schoo! 


subjects. Journal of Educational Psychology, March, April and May, 1922 
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TABLE II 


SHOWING HIGHEST, LOWEST, MEAN AND M. D. OF I. Q’S FOR THE 


GRADE 
1 AND 2, 


ACCORDING TO THE SEVERAL TESTS. 
DIFFERENT INDIVIDUALS MAY 


BE 


REPRESENTED BY EACH FIGURE 


6 #8 8 ae WE 

Highest 1Q.. .....:00«0+- 133 127 133 135 147 171 154 166 143 16.8 

Lowest 95 94 90106 100 93 80 90 98 4.6 

Month 112 112 109 120 120 120 116 115 116 8&8 

M. D. from Mean 8.9 7.2 8.3 5.9 10.2 20.015.316.2 8.7 2.2 
GRADE IV 

Fig hetlt ccscnsiicsunsescch 140 133 142 399 166 176 162 174 154 61.0 

99 100 104 93 113 104 100 104 113 6.0 

MOON icudnitomeenaes 120 113 124 168 135 128 126 136 132 17.2 

7.1 6.1 5.5 61.412.4 11.0 10.6 14.0 10.8 10.2 
GRADE V 

139 151 136 :°169 151 324 156 147 154 55.4 

Lowel -cccmcieentin 100 100 98 115 113 81 109 112 110 5.1 

Mon a ccuahennean 116 113 120 139 134 169 132 133 132 19.4 

M. 9.5 7.2 10.0 13.5 9.2 7913.9 7.3 12.3 11.7 
GRADE VI 

160 141 170 162 164 351 191 152 180 49.3: 

Lowel 83 88 80 92 95 70 92 97 90 48 

Mea <isedceeeend 120 115 123 121 130 179 130 125 132 19.3 

M. D.. ool 18.8 12.9 20.2 15.9 15.5 75.6 20.6 13.0 21.8 11.9 
GRADE VII 

Higham ccd Not * 183 * 155 408 181 147 210 79.0 

Lowel: Given 109 110 74 94 104 103 7.8 

Mest ccncapodccciies 137 132 261 138 127 159 43.9 

M. 14.2 18.0 96.217.3 9.2 19.6 24.4 
GRADE VIII 

Highest ...rcdescssans Not 148 * 1388 404 168 141 186 86.8 

Lowedl Given 105 91 88 85 96172 5.8 

Mean 124 116 261 124 111 147 46.9 

M. 9.0 8.2 56.114.2 8.9 13.7 16.9 


GRADE 
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Range 
Minus Lowest 
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286 
20 
157 
54 


*“Seores for nearly all pupils above the equivalent of highest 


ages given. 
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TABLE III 


Range of IQ in the Mean Deviation from 
Several Tests for the Mean IQ for the Aver- 


Average Individual age Individual 
73 172 


THE CAUSES OF THE UNRELIABILITY OF I1Q’S BASED 
ON GROUP TESTS 


An effort has been made to ascertain the causes of the extreme 
variations in IQ given to the same individual by different tests 
although we probably have been unable to discover them all or to 
evaluate accurately those that seem likely. 

First, the distribution of native mental ability in the groups 
upon which the tests were standardized has probably been un- 
equal. That this is often the case was clearly illustrated in con- 
structing norms for the National Intelligence tests.’ Such dif- 
ferences are not necessarily eliminated by mere accumulation of 
vast numbers of measures; a fact that is now quite generally 
recognized. It is probably an important source of inequality of 
the test norms.” 

Secondly, the character of school training, especially in read- 
ing, writing and arithmetic, affects the results considerably. For 
example, if certain grades were backward in arithmetic in com- 
parison with pupils upon whom norms were established, they 
would obtain lower IQ’s in those tests which contained relatively 
large amounts of arithmetical functions other things being equal. 


Again results will differ depending on the relative amounts of 
non-verbal material which the particular tests contain. Two in- 
vestigations have shown that in Grades III to VIII the correla- 
tions between verbal and non-verbal tests are low; sometimes 
negative in grade groups. In our results the association between 


7See L. M. Terman and E. D. Whitmire. Age and Grade norms for the 
National Intelligence Tests. Journal of Educational Research. February, 
1921, and G. M. Whipple, The National Intelligence Tests. Journal of 
Educational Research, June, 1921. 

‘J. P. Herring. Verbal and abstract elements in intelligence examina 
tions. Journal Educational Psychology. December, 1921, and A. I. Gates, 
same journal, March, April and May, 1922. 


ve 
he 
| 
t} 
| Is 
| st 
| | 
tl 
t] 
| a 
‘ 
a 
T 
d 
i 
2 t 
f 
t 


UNRELIABILITY OF M. A. AND I. Q. GROUP TESTS 99 


verbal and non-verbal tests becomes less close as the subjects 
become older. This fact is a partial explanation of the fact that 
the disagreement between IQ’s is greater in the upper grades. 

The factor of greatest importance in our results is practice in 
group tests. Table II shows that Scarborough ptpils earned IQ’s 
that are obviously too high. On the whole, the more practice 
preceding a particular test, the higher the IQ’s obtained. There 
is no doubt that practice results in very rapid improvement in 
such tests... There are three ways in which practice will cause 
variations in IQ from test to test: First, scores will advance con- 
siderably by virtue of general adaptation to test conditions; 
secondly, the gain will be relatively great in the particular tests 
which contain large amounts of identical or equivalent exercises; 
thirdly, the norms for the tests will be high or low, depending on 
previous practice in intelligence tests as well as on general mental 
abilities of the subjects on whom the norms were based. Do 
those who construct norms for their tests actually ascertain the 
amount, if any, of previous experience in tests their subjects have 
had? Without a doubt norms established on uninitiated pupils 
will differ considerably, native ability being equal, from those 
secured from pupils who have been tested extensively. 

Again, the cumulative procedure of establishing norms prob- 
ably tends in the long run to produce norms that are too low. 
Testers probably succeed in controlling the time better than other 
conditions. In the long run numbers of students fail entirely to 
understand methods of procedure, skip pages, lose time through 
distraction, broken pencils, etc., etc., whereas the factors conspir- 
ing for higher. scores are fewer. Scarborough pupils were 
handled in small groups. They were provided with several 
pencils. They had learned to do their best in all “tests.” Dis- 
turbances and misunderstandings very seldom occured. These 
factors would tend to produce differential success depending on 
whether a particular test contained more numerous speed or power 
exercises. Well practiced pupils would probably have a con- 
siderable advantage over the inexperienced particularly in the 
former, because they make a quicker start and maintain a steadier 
performance. 


Finally, most of the Scarborough pupils are of superior in- 
telligence as evidenced by the Stanford-Binet 1Q’s (which aver- 
age about 117°), by achievement in school subjects and by informa- 


‘For example, see Knight Dunlap and Agnes Synder. Practice Effects in 
Intelligence Tests. Journal Experimental Psychology, 1920, p. 357-377. 

This statement is made because most of the Stanford tests were given 
before the experience in group tests and because, on retesting this year, it 
is found that practice in group and other tests has not caused any marked 
increase in 1Q. 
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tion concerning the social status of parents. It is possible that 
very bright children with a little practice may exceed average 
children in these tests by a larger amount than the IQ ratios 
would suggest. In support of this view we find that some of the 
very bright pupils very greatly exceed the norms for the grades 
to which their mental ages correspond in speed of reading. In 
depth or power of comprehension (Thorndike-McCall) however, 
their excess over the average, while marked, is not nearly so 
great. This would make for differential success depending on the 
relative amount to which achievement in each of the several tests 
depends upon these abilities. 

It will not be profitable to speculate further; what we need is 
experimentation. These experiences have been sufficient to make 
one very cautious in the interpretation of group test scores and 
particularly cautious in the use of the M. A. or I. Q. 

The group tests are chiefly useful for purposes of rough 
classification and for very rough prediction but either should be 
undertaken only after inquiry into séveral factors. First, has this 
pupil ever taken such tests before? If so, which, when, and how 
many? We need data badly to enable us to make a rough correc- 
tion in such circumstances. Secondly, what has been the subject’s 
training in school subjects? It was found, for example, that some 
of our Grade III pupils made enormous improvement in most 
verbal group tests as the result of having improved in reading 
during the interval between two tests. We need information con- 
cerning the transfer of improvement in all school subjects to 
achievement in group tests. Finally did the subject lose any time 
for any reason during the test? This matter assumes great im- 
portance in some of the group tests. If all of the pupils have had 
no (or the same amount of) previous practice in group intelli- 
gence tests if their school training has been the same and if no 
accidents occur the results, at least those of verbal tests, are 
very useful but coarse instruments for classification. 

Perhaps a change from the term “intelligence tests” to some 
such term as “classification tests’ would have a favorable effect 
in reducing misunderstanding by teachers and laymen and in 
curbing misuse by over-zealous testers. It has been well said 
that the work of measurement “may soon need protection from 
over-extravagant hopes more than from hostile criticism.’ 


3B. LL Thorndike. Measurement in Education. The Twenty-first Year 
book of the National Society Study of Education. Part I, p. 8 1922. 
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NEWS AND COMMENT 


The Bureau of Vocational Guidance of-Harvard University has 
taken over the publication of the National Vocational Guidance 
Association Bulletin. It is their purpose to enlarge and improve 
this Bulletin as support becomes assured for it. , 

The Journal extends a cordial welcome to this new magazine in 
the field of vocational guidance. Professor Frederick J. Allen 
will serve as Editor. 


A NOTE ON POPULAR PSEUDO-PSYCHOLOGICAL 
BELIEFS 


Kornhauser and Jackson present data in a recent issue of this 
journal’ showing that actual business men do not use quack systems 
of judging character in their employment work. The results ob- 
tained by the use of their questionnaire are rather surprising in 
view of the apparently successful exploitation of “applied psy- 
chology” by various charlatans who invade our larger cities, ad- 
vertise extensively in the newspapers and on the bill boards, give 
free lectures in the best hotels or theatres and then conduct large 
classes for four or six weeks at so much per head. Dr. Joseph 
Jastrow at Wisconsin has made a special study of these quacks 
and the writer believes as the result of conversations with Dr. 
Jastrow plus his own observations in the quack-infected North- 
west that the merits of psychological research and applications 
are imperiled by this wide spread exploitation of pseudo- 
psychology. 

Pseudo-phychological beliefs amounting in some cases to super- 
stitions seem to be more common than most of us realize. The 
writer in conducting a summer school course in Applied Psy- 
chology was amazed to discover the range of interests expressed 
by these adult summer school students, but even more so, by the 
beliefs shown in answer to a questionnaire. The range of inter- 
ests included a desire to study the problem of learning, memory, 
how to study, business psychology, salesmanship, influencing 
people, personnel work, personality, vocational guidance, ab- 
normal psychology, psycho-analysis, spiritualism and character 
analysis. Below is given a summary of the beliefs expressed by 
these students at the beginning of the course in answer to a 
questionnaire. 

The above table speaks for itself. If mature summer school 
students exhibit such a variety of beliefs what must the average 
business man believe? It goes without saying that such a course 


A note for the extent to which systems of character analysis are used in 
the Business World."" Jour. Applied Psy., Vol. VI, No. 3, Sept., 1922, p. 302. 
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NEWS AND COMMENT 


SUMMARY OF BELIEFS EXPRESSED BY APPLIED PSYCHOLOGY 


STUDENTS 


Positive Positive Unknown or 


Do you believe in: Belief 
2 
6. Memory Systems 12 
1 

5 
9 
138. Absent Treatment. 1 
14. Gifted Children are “Peculiar” 12 


Psychology Can Determine 
What Specific Job a Man Is 
16 


Disbelief 
11 


_ 
ow © 


Uncertain 


— 


0 


needs devote some time and attention to evaluating the evidence 
for and against a large number of pseudo-psychological systems 
following this up with a presentation of what applied psychology 


can do and can not do. 


Thus it turns out that phychological in- 


struction must serve as a sobering survey of the limitations of 


psychological applications. 


It may be that business men are free 


from such false beliefs yet the undersigned classifies himself as a 
“doubting Thomas.” 
University of Minnesota 


DONALD G. PATTERSON. 
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BOOK REVIEWS 


CLAPAREDE, EDOUARD. Geneva, Switzerland, Problems and 
Methods of Vocational Guidance. International Labor Office, 1922. 
Price, 40 cents. 


Interest in vocational guidance has been growing so rapidly 
that the International Labor Office has taken cognizance of it as 
a subject of international importance. About two years ago the 
Office appointed the University of Geneva as its agent to collect 
information regarding the progress of the movement throughout 
the world, and to prepare news items for regular publication in the 
International Labor Review. This work has been carried on in 
increasing volume and with widening usefulness. In the meantime 
Professor Claparede has been conceiving and directing investiga- 
tions fundamental to vocational guidance from his laboratory and 
from the Institute J. J. Rousseau, and he has brought out this 
brochure as a precipitation of the results of his study. 


It is not a book. It is rather a survey that just escapes the 
severities of outline presentation. Furthermore it does not treat 
all phases of vocational guidance. Though it recognizes the im- 
portance of the social, economic, physiological and educational 
aspect it treats specifically only the psychological. This it en- 
visages with great clearness. Particularly sane and proportion- 
ate is the criticai discussion of the difficulties involved in the 
application of tests in practical vocational situations. In his 
summarized conclusions Professor Claparede gives what might 
be regarded as an international message: 


“In conclusion it may be well to summarize very briefly the 
steps which should be taken to place vocational guidance on a 
firm foundation. 


1. Psychological analysis of occupations and their classification 
according to the abilities required. 

2. Determination of corresponding abilities in individuals and, 
as a preliminary, the testing of tests to determine their diag- 
nostic, and even more their prognostic, value. 


3. Investigation of the process of training and educability. 


4. Determination of the importance of general intelligence in 
the various occupations. 


5. It should never be forgotten that a forecast of ability will 
never be a certainty, but only a probability. Statistics should be 
prepared to determine this probability. 

6. Efforts should be made to arrive at a uniform system of 
measurement so that psychographs or psychological diagrams 
may be comparable between different countries and between dif- 
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ferent vocational guidance offices. Grading by percentiles appears 
to be the most rational and least ambiguous method. 

It is in the social interests of the community that every man 
should be in the position for which he is best fitted. It is also 
desirable and increasingly in accordance with the spirit of modern 
democracy that the various social functions should be allotted ac- 
cording to merit and not in virtue of any privilege. The only 
means of attaining this ideal is by a sound system of vocational 
guidance. All those who have at heart the harmonious organiza- 
tion of society, both employers’ and workers’ organizations, should 
therefore facilitate by all means in their power any investigation 
having for its object the establishment of industrial psychology 
upon a firmer foundation.” 

Every American phychologist who is touching ever so lightly, 
either in his teaching or in his investigation, upon vocational prob- 
lems, should ponder on this brochure. He will find posed for him 
numerous problems for the experimental investigations so sorely 
needed, and he will find references to works of European investi- 
gators that are too little known in America. 

Indiana University H. D. KITSON. 


Prescott, F. C. The Poetic Mind. New York: The Macmillan 
Company, 1922. Pp. 308, $2.00. 


There are two classes of people to whom this book should be of 
special interest. The first are the psychologists. Professor Pres- 
cott has for the object of his study the working of the poet’s mind. 
His evidence is from the most reliable source—namely, from the 
poets themselves. An English professor, he has collected from far 
and wide statements of the poets and other creative artists con- 
cerning the source and manner of their inspiration; and with a 
surprising unanimity of opinion he has found that they testify to 
a power within from which their creative works emanate and over 
which they have no voluntary control. This mysterious source of 
power Professor Prescott attempts to identify with the subcon- 
scious mind. He seeks to show further, always depending for his 
chief evidence on the poets themselves, that the same principles 
which apply to dreams according to Freud apply also to poems, 
which are emanations of the subconscious mind just as much as 
are nocturnal dreams. Poems, like dreams, have their origin in 
thwarted desires. Step by step Professor Prescott parallels the 
two kinds of dreams, showing that the impulse, the censorship, 
displacements, condensations, fusions, and fictional gratification 
are common to both and have the same explanations. Whatever 
be the merits of his findings, he has decidedly done the psycholog- 
ists a service by bringing them into contact with a field of investi- 
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gation which few of them are likely to be so well qualified as he 
to explore—the mental operations during poetic inspiration as 
explained by the poets themselves. 

The book, however, should perhaps have even a greater appeal 
to the critic and the student of literature. In the first place, even 
if there is not a particle of truth in his thesis, it is a fascinating 
romance of the subterranean caverns of the human mind. In the 
second place, if his thesis is partly true, his work becomes valua- 
ble and suggestive. It sheds new light on such ancient themes 
as the relation of beauty to truth, the catharsis of tragedy, the 
universality of the greatest poetry, symbols and figures, and even 
puns. In the third place, it is very possible that his thesis is true. 
Once grant the existence of a subconscious mind of the kind ac- 
cepted by many modern phychologists and the rest easily follows. 
The subconscious mind is old and wise, but not accessible to the 
average individua!. The poet is one who for comparatively short 
intervals does have access to it, though through no volition of his 
own. What have been regarded as his vague and misty dreams 
are in reality flashes of truth from a deeper mind, which is racial, 
universal, unhampered by the limitations of time and space and 
immediate needs. Poetry, therefore, represents the highest wis- 
dom of the race. Scientists plod along, and by the aid of the 
conscious mind finally confirm what the poet saw in his vision. 
Shelley, without the aid of modern psychology, states the idea ex- 
actly in his Defense of Poetry: “In the infancy of the world, 
neither poets themselves nor their‘auditors are fully aware of the 
excellency of poetry: for it acts in a divine and unapprehended 
manner, beyond and above consciousness, and it is reserved for 
future generations to contemplate and measure the mighty cause 
and effect in all the strength and splendour of their union.” 

It seems to me, however, that there is at least one rather serious 
inconsistency in the book. In Chaper VIII Professor Prescott is 
inclined to accept the conclusion of Wordsworth that emotions recol- 
lected in tranquility are “generally” the source of poetry. In 
Chapter XV he attempts to show that poetry is the result of 
thwarted desires: that, as the poet may not have the object of his 
desire, he seeks relief, even though it be a veiled relief, in a fic- 
tional gratification, as Keats, let us say, in The Eve of Saint Agnes. 
Such a relief is obviously needed and obviously would come when 
the emotion is powerful and not when it is recollected in tran- 
quility. Likewise in the last chapter Professor Prescott has no 
basis for even suggesting the possibility of poets for seeing spe- 
cific events in a literal sense, as Shelley foresaw his own death 
by drowning at sea. 

Ohio University. B. L. JEFFERSON. 
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TERRY, PAUL WASHINGTON. How Numerals Are Read. Chicago, 
The University of Chicago, Supplementary Educational Mono- 
graphs, No. 18, 1922. Pp. XIV+109. Price, $1.00. 


Part I of Doctor Terry’s excellent monograph deals with studies 
of the reading of numerals by introspective methods. Part II 
contains an account of the use of photographic apparatus in deal- 
ing with this problem. 

It is the opinion of the reviewer that the chief value of the mono- 
graph lies in the suggested method of studying the problem 
rather than in the results of the experiments, since a very limited 
number of subjects participated. Part II, involving the use of 
photographic apparatus, contains the records of six subjects; in 
Part I, the number of subjects ranges in the various experiments 
from four to ten. 

In the first experiment, seven arithmetic problems were solved 
by ten graduate students who kept notes on their experiences while 
reading the problems “with special reference to numerals.” These 
observations were organized and recorded in a series of tables. 
The data made it possible to distinguish between a first reading 
and a re-reading in many instances as well as a partial and a 
whole first reading. 

Other experiments in which the method of introspective analysis 
was employed investigated the range of correct recall of numerals 
after the first reading, undertook an analysis of the re-reading of 
numerals in arithmetical problems and studied the successes of 
four subjects in reading numerals in columns. 

Important outcomes of the use of photographic apparatus are: 
“(a) The average number of digits included by a pause on nu- 
merals is decidedly smaller than the average number of letters 
included by a pause on words. (b) The average duration of 
pauses on numerals is greater than the average duration of pauses 
on words in the cases of all subjects. (c) The percentage of re- 
gressive pauses on numerals is greater than the percentage of such 
pauses on words.” 

The final chapter of the monograph contains very pertinent sug- 
gestions for teaching children to read arithmetic problems. Inas- 
much as the reading of numerals requires closer attention on the 
part of the reader than does the reading of words, it is suggested 
that during the first reading pupils be taught to devote their 
entire attention to the conditions of the problem, developing, at 
the same time, skill in the partial reading of the numerals. These 
conclusions are supported by the findings of Gray and Freeman 
in the field of reading. 

Ohio University. R. L. Morton. 
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KNIGHT, FREDERICK B. Qualities Related to Success in Teaching. 
Teachers College, Columbia University Contributions to Educa- 
tion, No. 120, New York City. Pp. x+67. 


When and why is a teacher a successful teacher? Many at- 
tempts have been made to answer this question in terms of such 
general categories as personality, poise, physical qualities, mental 
qualities, moral qualities, disciplinary skill, instructional skill, ete. 
The list can be extended and subdivided almost indefinitely from 
one of the several score cards for teachers now on the market. 
The objection that can be made to the above descriptive terms is 
their generality. Although they are more useful than a “general 
impression” as a guide to getting a useable criterion of teacher 
success, more exact and less subjective measures would be wel- 
comed by the school administrator. The study here under discus- 
sion represents a contribution to scientific education in that it gives 
results and particularly an improved method of attack on this 
problem. 

The first chapter summarizes the studies of Meriam, Elliott, 
and Boyce in this field. Chapters II, III, and IV are concerned 
with the data of the author’s investigation and the relative 
significance of the various traits and qualities studied. The last 
chapter gives a trenchant criticism of Boyce’s study. 

The general procedure of the first part of the investigation 
was as follows. Ratings as to the relative teaching ability of 156 
grade and high school teachers in three different school systems 
were obtained. That is to say, the best teacher was ranked first, 
the next best second, etc., when compared with all the other 
teachers of a given system. Three such rankings on the general 
ability to teach were secured from the supervisors, the teachers 
themselves, and a selected group of pupils respectively. By the 
method of correlation the validity of these rankings, and their 
relation to age, amount of experience, quality of handwriting, 
intelligence as measured by test, major academic interests, normal 
school scholarship, amount of professional study during active 
service, and ability to pass a professional test (“A Trade Test 
for Elementary School Teachers” by Knight and Franzen) was 
established. 

A few of the more significant conclusions may be given. 

“By using the coefficient of partial correlation we find, in the 
case of elementary school teachers, that, the factors of intelli- 
gence and normal school scholarship being constant, there is a 
mutual relationship of +.57 between ability to teach and ability 
to pass a professional test... The amount of professional study 


accomplished during active service is also indicative of success in 
teaching. 
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“In the case of high school teachers, intellectual differences, as 
measured by mental tests, appear to be significant. For the 
selection of high school teachers the use of mental tests would be 
of value. 

“These data, as a whole, may be interpreted to mean that the 
general factor of interest in one’s work becomes the dominant 
factor in determining one’s success in teaching....other measur- 
able traits, either alone or in combinations, are not adequate ex- 
planations of teaching success. 

“This study indicates that age, experience, quality of handwrit- 
ing, intelligence as measured by tests, normal school standing, or 
the expressed interests of teachers are not closely related to suc- 
cess in teaching.” 

In the second part of the study intercorrelations were found 
among the traits teaching ability, ability to understand and man- 
age people, intellectual ability, ability to manage things and 
mechanisms, general scholarship, skill in government or discipline, 
instructional skill, initiative, nervous and physical strength, 
ability to command the respect of pupils and general ability to 
get results, the teachers rating each other on these qualities 
after the connotations of these terms had been explained to them. 
Concerning these data the author concludes that the factor of gen- 
eral spread of estimate operates to invalidate such ratings. Thai 
is, “when a judge attempts to rate a teacher in some particular 
trait, his rating is a defense of his general estimate of that 
teacher, as well as a rating of the trait under consideration.” 

Doubtless the future will see a scientifically accurate and serv- 
iceable measure of teaching ability. The results and methods of 
such studies as this point the way to the solution of the problem. 

Colorado College HERMANN REMMERS. 


WALLIN, J. E. W., The Achievement of Subnormal Children in 
Standardized Educational Tests. Miami University Bulletin, 
Series XX, No. 7. $0.60. 


In this bulletin of ninety-seven pages, Dr. Wallin gives the 
results of some educational tests administered by special class 
teachers to about 300 St. Louis subnormal children. The tests 
used were the Ayers spelling scale, lists A to F and I, L and 0, 
and lists I and II in Starch’s Spelling scale, Gray’s oral reading 
and the spiral arithmetic excersises adapted from the Courtis tests. 

Dr. Wallin states that this “study should have a two-fold sig- 
nificance for educators interested in the study and training of 
feeble-minded and backward children: first, it is the only survey 
thus far made by several standardized educational tests of pupils 
enrolled in special schools for mental defectives; second, most 
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of the pupils tested had been psychologically examined and dif- 
ferentially diagnosed, so that it is possible to analyze the results 
from the educational tests with reference to the children’s in- 
telligence age, to their diagnostic classification, such as imbeciles, 
morons, borderline, backward, visual aphasia cases, etc., and to 
the grade assigned them by the special class teachers.” 

The results of the educational tests show that it is a waste of 
time and money to attempt academic instruction to low grade 
children below four years mentality and that little reading can be 
accomplished by a child under the mental age of six years. On 
the other hand the tests show that subnormal children who may 
develop to about the nine-year mental level should be taken care 
of in the public schools and that although their training should 
a reasonable amount of time should be devoted to academic 
be along the lines of the monumental arts and proper social habits, 
branches. Dr. Wallin is convinced that it is of extreme im- 
portance to take every precaution in placing children in schools 
and institutions for the mentally defective and that their educa- 
tional attainment ability must be considered as well as a mental 
diagnosis made. 


Ohio University. M. LA VINIA WARNER. 


High Schools and Sex Education: A manual of suggestions on 
education as related to sex, Edited by Benjamin C. Gruenberg, 
Ph. D. Prepared under the direction of the Surgeon General, 
U. S. Public Health Service, in co-operation with the U. S. 
Bureau of Education. Washington, D. C., Government Printing 
Office, 1922. Pp. 98. Price $0.50. 


This is a very helpful and practical book for teachers of all 
young people of secondary school age. It makes clear the need 
of better methods of training in matters of sex. In the very 
earliest pages it differentiates sex education from sex informa- 
tion—a point which’ the lay mind seems to have been slow in 
appreciating. Sex information without interpretation of its mean- 
ing to society and the individual and without inspiration to use 
it for the uplift of the race falls far short of its proper place in 
education. This litthe manual has been built up out of the accom- 
ulated experience of thousands of teachers who have attempted to 
handle the delicate subject of sex. It outlines the necessary prepa- 
ration of teachers of sex education, discusses the type of person- 
ality necessary to make such teaching valuable and presents prac- 
tical methods and devices by which sex education may be accom- 
plished through the regular courses in biology, general science, 
physiology, physical education, home economics, and the social sci- 
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ences. Numerous references are cited in order that the reader may 
have an adequate background for more detailed reading. In the brief 
appendices at the end of the book are some very valuable sug- 
gestions on the use of special lectures, exhibits, courses of study 
for teachers, personal hygiene and physical examinations. A very 
comprehensive bibliography of books dealing with sex and sex 
education ends the book. As a manual of what is being done and 
can be done in sex education this book deserves a place in every 
teacher’s library. 
Ohio University. G. H. GRowpon. 


The World Book Company, Yonkers, N. Y., and 2123 Prairie 
Ave., Chicago, Ill., announces the following: 

1. Otis Self-Administering Tests of Mental Ability, a new 
series of tests for grades from four to twelve and for college 
freshmen. The Higher Examination is for grades nine to twelve 
and college freshmen, the Intermediate for grades four to eight. 
Experimental try-outs have been given in preliminary editions. 
Results have given high correlations with teachers’ marks and 
estimates. Four new advantages, it is argued, are to be had in 
these tests they are largely self-administering, very easily and 
quickly scored, afford a flexible time limit, and a wider variety of 
test material. There are two forms, A and B, each of equal 
difficulty. The Directions and Key with each package of 25, price 
$1.00 for Intermediate, $1.10 for Higher Examination. 

2. Pintner-Cunningham Primary Mental Test: This is a new 
group test for the kindergarten, first and second grades and the 
only one on the market suitable for kindergarten pupils. It has 
been tried out in several previous editions. The advantage of 
a test applies to the kindergarten and primary grades are that 
it enables the teacher to classify beginners (with no previous 
school record), and also to discover by comparison of results, the 
degree of everlapping in mental ages. 

The examination consists entirely of pictures. No reading or 
writing is necessary on the part of the pupils. The administering 
is sufficiently simple to require no special training on the part of 
the teacher. 

The examination booklets are sold in packages of twenty-five 
at $1.45 net per package. The postpaid price of a Specimen Set 
is 20 cents. One Manual of Directions, one Class Record, one 
Report to Author and Percentile Graph are included in each 
package. 

3. National Inteiligence Tests, Supplement No. 2. The Com- 
mittee on National Intelligence Tests of the National Research 
Council has prepared Supplement No. 2 to the Manual of Direc- 
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tions for the tests. This supplement contains revised age norms, 
based tpon the examination of 32,372 pupils in nineteen com- 
munities; composition of material for the age norms; data show- 
ing variability of the average mental ability of children from 
city to city; mental age equivalent of scores; revised grade norms, 
with tables showing composition of the materials used, and a 
formula for finding IQ’s and table for converting years and 
months into months. 
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Books and pamphlets for review should be sent to James P, 
Porter, Department of Psychology, Ohio University, Athens, Ohio. 


BosBITT, FRANKLIN. Curriculum-Making in Los Angeles. Sup- 
plementary Educational Monographs, No. 20, June, 1922. 
University of Chicago, Chicago, Ill. 106 pp. 

BoNNER, H. R. Statistics of Teachers’ Colleges and Normal 
Schools. 1919-1920. Department of the Interior, Bulletin No. 
8, 1922. Washington, D. C. 68 pp. 

Consolidation of Schools in Maine and Connecticut. Department 
of the Interior, Rural School Leaflet No. 4, 1922. Washington, 


D. C. 12 pp. 


Cook, KATHERINE M. Supervision of Rural Schools. Department 
of the Interior, Bulletin No. 10, 1922. Washington, D. C. 
111 pp. 


CRAFTS, WILBUR F. That Boy and Girl of Yours—Sociology from 
Viewpoint of the Family. The Baker & Taylor Company, 354 
Fourth Avenue, New York. Price $1.75. 432 pp. 

DEFFENBAUGH, W. S. Administration of Schools in the Smaller 
Cities. Department of the Interior. Bulletin No. 2, 1922. 
75 pp. 

DELGADO, Honorio F. El Dibujo de los Psicopatas. Casilla 1589, 
Lima, Peru. 26 pp. 


Epson, NEWELL W. Status of Sex Education in High Schools. 
Department of the Interior, Bulletin No. 14, 1922. Wash- 


ington, D. C. 12 pp. 


Fitt, A. B. The Human Instincts in Business. The Lothian 
Book Publishing Company Pty., Ltd. The Rialto, 497 Collins 
St., Melbourne, Australia. 

FoOLMER, ANTH AND GERRITSZ, J. Tijdschrift Van Den Neder- 
oe Werkloosheids-raad. Bureau: Singel 530, Amster- 

am. 

GESELL, ARNOLD. Exceptional Children and Public School Policy. 
Yale University Press, New Haven, Conn. 66 pp. 

GIVLER, ROBERT CHENAULT. Psychology—The Science of Human 
Behavior. Harper & Brothers, Publishers, New York. Price 
$3.00. 382 pp. 

Hess, BERTHA Y. Value of the School Census. Department of 


the Interior, City School Leaflet No 3. Washington, D. C. 

HoKe, Roy Epwarp. Improvement of Speed and Accuracy in 
Typewriting. Johns Hopkins University Studies in Educa- 
tion, No. 7. Johns Hopkins Press, Baltimore, Md. Price 75 
cents. 41 pp. 

HOME ECONOMICS IN RURAL SCHOOLS. Department of the Interior. 
Home Economics Circular No. 13, May, 1922. Washington, 
D. C. 15 pp. 

Hoop, WiLLIAM R. Review of Educational Legislation 1919 and 
1920. Department of the Interior, Bulletin No. 13, 1922. 
Washington, D. C. 30 pp. 

JOHN, WALTON C. Statistics of Agricultural and Mechanical 
Colleges for 1919 and 1920. Department of the Interior, 
Bulletin No. 27, 1922. Washington, D. C. 29 pp. 

Junior High School Mathematics. Course of Study and Syllabus. 
Cleveland Board of Education, Cleveland, Ohio. 45 pp. 
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KING, LE Roy ALBert. Status of the Rural Teacher in Penn!- 
venia. Department of the Interior, Bulletin No. 34, 1921. 
Washington, D. C. 86 pp. 

LATHROP, EpiItH A. How Laws Providing for Distribution of 
State School Funds Affect Consolidation. Rural Schoo) 
Teaflet, No. 5. August, 1922, Department of the Interior. 
Washington, D.C. 12 pp. 

LAUGHLIN, HARRY HAMILTON. Eugenical Sterilization in the 
United States. Psychopathic Laboratory of the Municipal 
Court of Chicago. Chicago, I!l. 502 pp. 

LOMBARD, ELL"=N. Home Education by Means of Readi ng Course 
and the Co-operation of State an? National Agencies. De- 
partment of the Interior. Home Education Circular No. 2. 
Washington, D. C. 11 pp. 

List of References on Rural Life and Culture. Library Leaflet No. 
16, Department of the Interior, Washington, D. C. 10 pp. 
PEAR, T. H. *Remembering and Forgetting. E. P. Dutton & Com- 
pany, 685 Fifth Avenue, New York. Price $3.00. 242 pp. 
Professional Education of Teachers in Cleveland. ..Cleveland 

Board of Education, Cleveland, Ohio. 92 pp. 

Record of Current Education Publications. Department of the 
Interior, Bulletin No. 33, 1922. Washington, D. C. 30 pp. 

Reorganization of Home Economics in Secondary Schools. De- 
“ome of the Interior, Bulletin No. 5, 1922. Washinzton, 

38 pp. 

Statistics of Universities, Colleges and Professional Schools 
1919-20. Department of the Interior, Bulletin No. 28, 1922. 
Washington, D. C. 147 pp. 

Surveys of the Department of Instruction, Cleveland Public 
Schools. Cleveland Board of Education, Cleveland, Ohio. 
29 pp. 

THWING, CHARLES FRANKLIN. Higher Education in Australia and 
New Zealand. Department of the Interior, Bulletin No. 25, 
1922. Washington, D. C. 44 pp. 

VALDEZAN, HERMILIO AND DELGADO, Honorio F. Revista de 
Psiquiatria y Disciplinas Conexas. Publication Primestral 
Vol. IV No. 3. Casilla 1589, Lima, Peru. 262 pp. 

J. Harotp. The Twenty-Four Hour School. Educa- 
tional Research Bulletin, Vol. I, No. 6, Pasadena City Schools, 
Pasadena, Calif. 8 pp. 

Winpes, Eustace E. A Plan for the Organization of a County 
System of Agricultural Instruction in Elementary Rural 
Schools. Department of the Interior. Rural School Leaflet 
No. 6. Washington, D. C. 8 pp. 

Zook, GeorcE F. National Conference of Juirior Colleges, 1920. 
Department of my Interior, Bulletin No. 19, 1922. Washing- 
ton, D. C. 73 p 

Zook, GEORGE F. , le on the Higher Educational Institutions 
of Arkansas. Department of the Interior, Bulletin No. 7, 

1922. Washington, D.C. 18 pp. 
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